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ABSTRACT 


An  empirical  study  of  the  time-sharing  system  CP/67  is  reported 
in  this  thesis.  Certain  selected  system  and  user  parameters  are 
studied  to  obtain  system  performance  measures  and  an  indication  of  the 
general  behaviour  of  the  users.  A  unique  method  for  gathering  data  is 
given.  The  method  involves  making  no  change  to  the  system  internally 
but  uses  a  virtual  machine  to  obtain  the  required  data. 

The  singularly  important  result  obtained  is  the  exponential 
relationship  between  the  number  of  users  and  the  paging  load.  The 
behaviour  of  the  APL  virtual  machine  in  this  time-sharing  environment 
is  studied  in  detail.  The  APL  virtual  machine  is  found  to  be  demanding 
an  extremely  large  amount  of  main  storage.  It  is  also  found  to  be  an 
"I/O  bound"  user. 

The  success  of  the  "data-gather ing"  method  indicates  that  a 
more  detailed  study  of  this  sytem  should  be  carried  out.  Suggestions 
for  such  studies  are  included  in  the  thesis. 
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CHAPTER  I 


INTRODUCTION 

Computer  systems  that  are  able  to  serve  a  single  user  in  a  con- 
versional  or  interactive  manner  have  been  in  existence  for  relatively 
many  years.  Lately,  for  economic  and  other  reasons,  interactive  computer 
systems  have  been  implemented  to  serve  many  users  "simultaneously." 

Directing  attention  to  information  processing  techniques,  such  as 
data  reduction,  simulation  models,  casual  "desk-calculator"  computation, 
on-line  experimentation,  programming  language  translation,  and  data 
retrieval,  it  can  be  noticed  that  each  of  these  processes  puts  its  own 
specific  requirements  upon  the  hardware,  software,  and  interface  charac¬ 
teristics  of  the  system  in  the  sense  of  minimizing  on  cost,  data  and 
computation  rates,  storage  sizes,  and  interface  modes  and  languages.  On¬ 
line  experimentation,  regardless  of  its  specific  nature,  for  example, 
implies  sufficiently  high  data-rate  sensors  and  internal  processing  speed 
to  keep  pace  with  the  experiment,  with  appropriate  display  and  input 
facilities  for  the  user  both  to  monitor  and  direct  the  course  of  the 
experiment.  Data  reduction  and  program  language  translation  imply  the 
ability  to  handle  the  input,  output  and  internal  processing.  The  rates 
and  costs  are  determined  by  what  the  user  is  willing  to  accept.  Sim¬ 
ulation  can  cover  a  broad  class  of  phenomena,  some  of  which  can  place 
inordinate  demands  upon  central  processing  speeds  and  costs.  Data 
retrieval  applications  require  a  large  amount  of  storage,  ready  access. 
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and  comprehensible  display  of  the  data;  while  a  personalized  "desk- 
calculator"  requires  a  minimum  of  an  appropriate  keyboard  and  printer  with 
an  easily  usable  programming  language. 

For  any  application,  these  processes  are  preceded  by  a  period  of 
program  development  and  checkout,  which  may  well  require  sizable  amounts 
of  computation.  Thus,  applications  require  the  availability  of  low-cost 
computation  capacity,  which  will  handle  both  the  debugging  of  the  soft¬ 
ware  and  the  production  runs . 

Sec.  1.1.  Time-sharing  Systems 

The  following  three  major  types  of  computer  systems  are  available. 
First  there  is  the  system  which  is  usually  "small"  and  relatively  "inex¬ 
pensive"  and  is  operated  by  a  single  "on-line"  user;  that  is,  the  computer 
is  used  through  its  manual  input-output  interface.  Second,  there  is  the 
"large,"  centralized  "computer  center"  in  which  the  user's  programs  are 
in  batches  and  several  may  be  executed  at  the  same  time  by  a  multipro¬ 
gramming  operating  system.  Finally,  there  is  the  on-line  "remote  access" 
system  with  a  distributed  set  of  input-output  terminals  that  can  be 
connected  to  a  centralized  computing  facility  via  public  or  private 
communication  lines.  To  a  user,  such  a  system  can  be  "on-line'  in  the 
sense  that  the  processing  equipment  is  capable  of  directly  servicing  his 
requests.  This  last  category  is  usually  called  "time-sharing  computing," 
since  the  processing  techniques  require  time  multiplexing  of  the  users' 
requests . 


At  any  given  time  in  the  operation  of  a  time-sharing  system,  some 
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portion  of  the  interactive  users  may  require  particular  programs  to  be 
executed.  The  users'  requests  are  selected  in  some  given  order,  and  each 
request  is  executed  for  a  given  time  interval,  not  necessarily  to  comple¬ 
tion.  Typically,  a  particular  user's  program  will  be  allowed  to  use  a 
processing  unit  for  a  period  of  time,  will  be  stopped  so  that  another 
user's  program  can  run,  and  at  some  later  time  will  be  continued  from 
the  point  where  it  was  stopped.  At  the  point  in  time  when  one  user's 
program  is  stopped  and  another's  resumed,  the  status  of  the  former  must 
be  saved  and  that  of  the  latter  restored,  in  order  to  continue  a  program 
from  where  it  was  discontinued  previously. 

This  relatively  new  approach  requires  some  justification  and 
hence  systematic  analysis  is  necessary. 


Sec.  1.2.  Analysis  and  Measurements 

Measurements  made  on  time-shared  systems  have  the  following 

purposes : 

1.  to  make  a  choice  between  one  system  and  another,  using  the 
different  possible  configurations  as  decision  criterion 

2.  to  deepen  the  understanding  of  a  system  so  as  to  optimize  a 
given  system  or  to  affect  the  design  of  other  systems. 

Scherr  (23),  at  the  conclusion  of  his  monograph  states  the 
following : 

With  the  advent  of  new  time-sharing  techniques  and  systems,  much 
more  analysis  and  measurements  must  be  accomplished  if  these  systems 
are  to  be  designed  and  used  intelligently.  It  is  clear  that  no  small 
effort  should  be  expended  in  order  to  monitor  and  measure  the  per¬ 
formance  and  use  of  any  operating  time-shared  system.  It  is  through 
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intelligent  use  of  this  type  of  data  that  improvements  can  be  made 
to  present  systems  and  that  proposed  systems  can  be  evaluated. 

The  following  three  approaches  are  open  to  an  analyst  faced  with 
evaluation  of  system’s  performance  of  a  time-sharing  system. 

1 •  Mathematical  Modelling.  This  involves  modelling  of  time  and  space 
scheduling  algorithms  of  the  system.  In  most  cases  it  is  constrained  to 
more  or  less  simplified  queueing  models  of  these  algorithms.  This  almost 
inevitable  simplification  of  the  physical  system,  so  that  for  example 
"Markov-type"  analysis  can  be  applied,  is  one  of  the  major  drawbacks  to 
an  analyst  using  the  mathematical  approach.  In  spite  of  the  above- 
mentioned  difficulty,  this  is  still  the  most  widely  used  method  of 
analysing  time-sharing  systems. 

2.  Simulation  Models .  Simulation  models  are  very  useful  since  the  level 
of  detail  necessary  in  handling  some  of  the  features  of  time-sharing 
systems,  are  beyond  the  scope  of  mathematically  tractable  models.  Markov 
models  cannot,  in  general,  be  used  to  represent  processes  where  other 
than  random  queueing  is  used.  Queueing  models  are  not  applicable  for 
processes  where  the  arrival  rates  of  service  requests  are  a  function  of 
the  service  rate.  This  kind  of  problem  can  be  easily  overcome  by 
simulation  studies.  They  can  be  used  in  future  prediction  of  the  system 
performance  by  varying  certain  parameters  like  the  size  of  storage  and 
the  speed  of  data  transfer  through  a  channel  device.  The  detracting 
factor  of  a  simulation  model  may  be  that  it  requires  a  large  amount  of 
care  and  computing  time  in  order  to  incorporate  the  details  necessary  to 
represent  a  complex  time-sharing  system  accurately. 
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3.  Statistic  Gathering.  The  method,  in  this  final  approach  available 
to  an  analyst,  and  perhaps  the  closest  to  the  real  situation,  involves 
gathering  data  on  certain  selected  system  parameters  and  user  character¬ 
istics.  We  will  discuss  this  last  method  and  consider  its  advantages  in 
the  next  section. 

Sec.  1.3.  Statistical  Approach 

The  last  of  the  three  methods  discussed  in  the  preceding  section 
has  some  distinct  advantages  over  the  other  two  methods.  Comparing  it 
with  mathematical  modelling  and  simulation  it  can  be  noticed  immediately 
that  it  is  not  constrained  with  any  of  the  necessary  and  simplifying 
assumptions  binding  the  other  two  techniques.  There  are  no  a  priori 
assumptions  made.  The  method  of  "stat-gathering"  allows  us  to  arrive 
at  conclusions  based  solely  on  the  performance  of  the  system  as  it 
behaved  at  the  time  of  gathering  this  information.  For  example,  the 
persistent  assumption  of  Poisson  arrival  rate,  made  in  most  of  the 
mathematical  queueing  models,  is  discarded.  Instead,  the  actual  "arrival 
times"  are  obtained  through  direct  observation,  which  could  then  be  used 
as  the  input  to  the  other  two  models.  Thus,  the  statistical  approach  is 
easier  and  more  reliable  than  the  mathematical  and  simulation  approach. 

A  stat-gathering  system  or  model  should  be  such  that  it  does  not 
require  too  much  storage  space  and  processing  time  because  this  would 
imply  biasing  the  data  on  the  use  of  the  system  by  the  analyst’s  compu¬ 
tations.  The  analyst  should  make  full  use  of  the  fast-access  storage 
available  to  him  as  a  user  of  the  time-sharing  system  under  study. 
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The  problem  of  data-reduction  can  also  be  a  very  detracting  factor 
for  the  analyst  using  stat-gathering  methods.  This  could  be  avoided 
effectively  by  carefully  choosing  the  parameters  to  observe.  The  aim 
should  be  to  obtain  maximum  information  from  minimum  data.  This  involves 
a  great  amount  of  analysis  completed  prior  to  actually  commencing  with 
the  data-gathering.  This  analysis  could  involve  decisions  like,  how 
often  should  the  observations  be  taken.  The  balance  between  the  quantity 
of  data  and  its  usefulness  should  always  be  kept  in  mind.  Gathering 
data  for  little  or  no  additional  information  should  be  avoided.  This 
would  only  put  an  unnecessary  load  on  the  time-sharing  system  and  would 
probably  be  detrimental  to  its  performance. 

These  three  approaches  are  discussed  in  greater  detail  in  the 
next  chapter  pointing  out  the  amount  of  research  carried  out  in  modelling 
time-sharing  systems. 


CHAPTER  II 


REVIEW  OF  LITERATURE 

In  the  late  fifties  the  idea  of  "time-sharing"  was  first  proposed 
by  a  group  of  computer-oriented  scientists  at  M.I.T.  and  since  it 
materialized  in  Project  MAC  and  other  implementations ,  research  in  such 
systems  has  been  growing  steadily.  Research  has  been  carried  out 
extensively  in  the  three  main  approaches  of  analyzing  such  systems. 

In  the  next  three  sections,  a  review  of  the  mathematical  approach, 
the  approach  through  simulation,  and  the  empirical  data- gathering 
(measuring)  approach,  respectively,  is  given. 

Sec.  2.1.  Mathematical  Approach 

Before  proceeding  with  a  review  of  the  mathematical  modelling  of 
time-sharing  system,  some  definitions  are  necessary.  The  definitions  in 
this  section  pertain  to  the  papers  reviewed. 

Time-sharing  systems  work  in  the  following  manner.  Incoming  tasks 
are  queued  and  scheduled  for  service  according  to  a  particular  scheduling 
policy.  At  its  scheduled  "service-time,"  each  job  is  processed  for  a 
time  period  called  a  "quantum."  If  during  this  quantum  the  task  is  com¬ 
pleted,  that  task  departs  and  service  begins  on  the  next;  otherwise,  the 
uncompleted  task  rejoins  a  queue  to  await  further  service.  The  time  the 
computer  spends  on  scheduling,  allocating,  buffering,  and  controlling 
terminal  input  and  output  represents  a  slice  of  processing  time  that  may 
be  called  "overhead."  If  not  all  the  active  parts  of  user-programs  fit 
into  main  storage,  processing  time  may  be  lost  while  awaiting  the 
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interchange  of  the  task  just  completing  its  quantum  of  service  and  the 
task  that  is  next  scheduled  to  run.  This  is  usually  referred  to  as  the 
"swapping"  time  in  the  literature. 

Mathematical  models  of  time-sharing  systems  quite  naturally  are 
stochastic  and  their  analysis  thus  draws  heavily  from  queueing  theory. 

A  number  of  such  models  now  exist;  in  general  these  differ  only  in  the 
choices  for  the  following  factors: 

1.  The  number  of  input  queues  can  either  be  finite  or  can  be 
considered  to  tend  to  infinity. 

2.  The  number  of  central  processors  considered  has  not  exceeded 
two. 

3.  The  type  of  arrival  processes  used  are  Bernoulli,  single 
Poisson  stream,  or  multiple  Poisson  streams. 

1*.  The  type  of  service  processes  applied  have  been  Geometric, 
Exponential,  Erlang,  or  general. 

5.  The  swapping  time,  say  T  ,  and  the  overhead  time,  say  T  ,  are 
either  both  considered  to  be  zero  or  their  sum,  T  +  T  ,  is 

b  U 

considered  to  be  a  constant  or  a  random  variable. 

6.  The  quantum  length  is  either  finite  or  tends  to  zero. 

7.  Some  of  the  service  disciplines  used  are  the  round-robin,  the 
first-come-first-served  to  completion,  and  priority  systems. 

The  main  performance  measurements  considered  are  the  following: 

1.  Average  response  time — the  average  time  between  the  entering 
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of  a  request  at  the  terminal  and  the  acknowledgement  that 
the  request  has  been  completed. 

2.  Average  waiting  time  in  the  queue — The  average  time  a 
request  spends  in  the  queue  before  receiving  service. 

3.  Average  queue  length — The  average  number  of  requests  in  a  queue. 

A  model  of  a  time-sharing  system  with  one  processor,  a  finite 
number  of  channels,  and  round-robin  scheduling  (Fig.  2.1)  has  been 
studied  extensively  by  Adiri  and  Avi-Itzhak  (l) ,  Greenberger  (15) , 

Scherr  (23),  Coffman  and  Krishnamoorthi  (6),  Krishnamoorthi  and  Wood  (l8), 
Patel  (22),  Krishnamoorthi  (19),  and  Coffman  (T). 

Infinite  input  source  model  analysis  was  initiated  by  Kleinrock 

(16)  ,  who  considered  a  round-robin  scheduling  policy  again.  Kleinrock 

(17) ,  in  a  later  paper  also  investigated  the  processor-shared  model  in 

which  there  are  P  priority  groups.  The  input  to  the  p^1  priority  group 

is  Poisson,  with  average  rate  Ap,  and  each  arrival  to  the  p"^  group  has 

an  exponentially  distributed  service  requirement  with  a  mean  of  l/up 

operations  where  up  is  the  service  rate  and  p  =  1,  2,  ...  ,  P.  A  member 

of  the  p^  group  is  given  g  Q  seconds  of  service  each  time  he  reenters 

P 

the  queue  where  Q  is  the  fixed  quantum  length  and  g  a  weight  factor  for 

P 

_i_  Vi 

the  p  group.  Kleinrock  in  this  paper  studied  the  effect  of  the  priority 
assignment  on  the  total  time  a  request  spent  in  the  system.  Ke  found  that 
for  each  priority  group  there  exists  a  critical  value  of  the  required 
service  time  below  which  the  wait  time  in  the  RR  ( round- robin)  system  was 
less  than  in  the  "first-come- first-served"  to  completion  system. 
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Others  who  have  studied  the  infinite  source  RR  system  are 
Chang  (4,  5),  Coffman  (8),  and  Shemer  (27). 

Foreground-background  systems  (Fig.  2.2)  have  been  proposed  and 

studied  by  Corbato  (10)  ,  Coffman  (7),  Coffman  and  Kleinrock  (9), 

Shemer  (27),  and  Schrage  (24).  Studies  have  been  made  of  general  FB^T 

N 

models  and  also  FB  models. 

00 

Another  major  scheduling  policy  to  note  is  the  "shortest 
remaining  processing  time"  scheduling  discipline  which  has  been  studied 
by  Schrage  and  Miller  (25)  and  Schrage  (26). 

As  mentioned  in  Chapter  I,  all  these  papers  make  some  assumptions 
on  arrival  and  service  time  distributions  to  use  queueing  theory 
techniques.  This  is  a  drawback,  but  this  type  of  analysis  should  provide 
the  system  designer  with  a  number  of  degrees  of  latitude  with  which  to 
synthesize  a  time-shared  processing  system. 

Sec.  2.2.  Simulation  Models 

Five  of  the  major  simulation  efforts  of  time-sharing  systems  are 
given  below: 

1.  Scherr  (23)  has  simulated  the  M.I.T.  Project  MAC's  Compatible 
Time-Shared  System  (CTSS).  He  considers  the  following 
simulation  models;  CTSS,  CTSS  with  RR  scheduling  policy 
instead  of  the  existing  FB9  policy  at  Project  MAC,  and  CTSS 
with  multiprogramming  to  provide  overlapped  processing  and 


swapping; 


* 
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2.  Fine  and  Mclsaac  (ik)  have  simulated  a  system  similar  to  the 
System  Development  Corporation  (S.D.C.)  Time-Sharing  System. 
They  consider  a  round- robin  system  as  well  as  a  two-queue, 

FB^,  scheduling  policy; 

3.  Fife  and  Rosenberg  (13)  have  simulated  a  system  in  which 
the  central  notion  is  one  of  memory-sharing  with  fixed 
memory  partitioning.  The  main  core  is  divided  into  five 
memory  blocks.  Each  user  is  confined  to  remain  within  a 
single  block.  Jobs  queue  up  for  an  allocation  to  core 
memory,  and  once  in  core,  remain  there  until  the  job  is 
completed  or  until  the  program  halts ,  awaiting  an  operator 
message  (or  until  an  illegal  command  is  generated). 

Processing  is  transferred  to  a  new  job  in  core  when  the 
current  program  being  run  fills  its  output  buffer; 

k.  Lavita  (20)  has  simulated  a  FCFS  run- to- completion  system 
with  emphasis  on  the  comparison  of  "response  time"  between 
a  drum  system  and  a  disk  system; 

5.  Neilson  (21)  has  simulated  Stanford's  360/67  time-sharing 

system.  He  varied  the  basic  hardware  (e.g.  1  or  2  processors), 
speed  of  data  transfer  to  and  from  storage  devices  (e.g.  disks) 
and  obtained  an  optimum  configuration  of  the  desired  system. 
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He  also  found  that  the  capabilities  of  disks  are  far  behind 
the  capabilities  of  the  rest  of  the  system,  considering  the 
delay  caused  by  disk  I/O.  He  also  investigated  the  difference 
between  "paging”  from  disk  and  drum. 

Sec.  2.3.  Measurement  Studies 

There  have  been  four  major  measurement  studies  of  actual  time¬ 
sharing  systems. 

1.  Scherr  (23)  has  made  measurements  of  the  behaviour  and  system 
performance  of  the  M.I.T.  Project  MAC's  CTSS  (mentioned  in 
Sec.  2.2.).  During  the  period  of  his  measurements,  CTSS 
served  approximately  250  users  with  an  IBM  709^  augmented  by 
a  7320  drum  and  a  130/-2  disk  storage  device.  He  obtained 
probability  distributions  of  interarrival  times  (think  time), 
service  time,  program  size,  and  investigated  relations  between 

response  time/processor  time  and  number  of  users;  computer 
utilization  and  number  of  users;  and  utilization  of  swap  storage 
and  number  of  users. 

2.  Totschek  (29)  reports  measurements  made  on  the  S.D.C.  Time- 
Sharing  System  which  can  handle  up  to  35  users  at  remote  con¬ 
soles.  He  obtained  the  probability  distributions  of  think 
time,  service  time,  program  size,  and  number  of  users.  Totschek 
also  investigated  (like  Scherr)  the  relationship  between 
(response  time/processor  time)  and  number  of  users,  and  tape 
usage  and  number  of  users. 

3.  Sutherland  (28)  has  made  measurements  on  the  Lawrence  Radiation 


/ 


Laboratory  Time-Shared  System  which  is  implemented  on  the 
GDC  6600.  The  system  handles  ^8  teletype-equipped  consoles 
which  serve  110  users.  He  obtained  relations  between  the 
assigned  priority  and  program  size;  percentage  of  time  spent 
on  user  programs  and  day,  user,  idle,  and  disk  time  and  day, 
and  finally,  memory  map  and  program  size. 

J*.  Cantrell  (3)  has  reported  his  measurement  of  the  GE- 

Dartmonth  Time-Shared  System.  This  system  handles  200  users 
at  remote  terminals.  He  obtained  the  probability  distri¬ 
butions  of  service  time,  program  size  and  disk  requests.  He 
also  investigated  the  relationship  between  breakdown  of 
computer  utilization  and  processor  time. 

Of  primary  interest  with  respect  to  user  characteristics  is  the 
distribution  of  think  times.  Scherr  finds  a  fair  degree  of  agreement 
between  simulation  and  actual  measurements,  and  it  is  interesting  to 
note  that  both  yield  an  average  think  time  of  approximately  35  seconds. 

The  other  major  user  characteristic  is  the  distribution  of  service 
time  (swap  time  excluded).  Totschek  comments  that  the  log-normal  distri¬ 
bution  fits  both  service  time  and  think  time  reasonably  well.  Cantrell 
observes  that  90 %  of  all  service  times  represent  just  slightly  more  than 
10 %  of  the  system  load  (as  far  as  processor  time  is  concerned)  and, 
therefore,  10%  accounts  for  almost  90%  of  the  load. 


The  distribution  of  program  sizes  provides  little  agreement  among 


> 


15 


the  above-listed  sources.  Scherr  reports  an  average  program  size  of 
6.3  x  10  words.  It  is  interesting  to  note  at  this  point  that  all  con¬ 
clusions  drawn  from  results  obtained  are  based  on  "average"  values.  This 
could  be  a  major  setback. 

The  results  obtained  by  Scherr  for  the  ratio  of  response  time  to 
processor  time  as  a  function  of  mean  number  of  interactive  users  for  the 
CTSS  measured  data,  the  CTSS  simulation  and  the  RR  simulation  all  agree 
remarkably  well.  Furthermore,  they  all  agree  with  his  mathematical,  RR 
processor-shared  model!  Totschek’s  curves  are  similar  in  shape  to  that 
of  Scherr* s,  but  an  absolute  comparison  is  difficult  to  make  because 
Totschek  includes  I/O  time  in  the  response  time. 

The  other  measurements  concerned  with  system  performance  are 
widely  dispersed  in  character  and  of  a  probing  nature  rather  than  strong¬ 
ly  relevant  to  some  model.  They  reflect  the  observations  of  Estrin,  et  al. 
(ll)  that  present  methods  of  measurements  do  not  allow  sufficient  freedom 
in  design  of  experiments  on  complex  systems.  Moreover,  the  documentation 
on  actual  measurements  is  poor  in  the  sense  that  the  method  of  obtaining 
these  data  is  rarely  given  in  detail. 

In  the  next  two  chapters,  a  description  of  CP/67,  the  description 
of  the  statistical  (empirical)  evaluation  model  and  the  method  of  obtain¬ 


ing  data  is  given. 


, 
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CHAPTER  III 


A  DESCRIPTION  OF  THE  CP/67  TIME-SHARING  SYSTEM 

The  description  of  the  time-sharing  system  -  CP/67,  (Control 
Program/67)s  is  given  here. 

Sec.  3.1.  CP/67  ~  A  General  Description 

CP/67  is  a  control  program  designed  for  execution  on  an  IBM 
System/360,  Model  67.  Its  objective  is  to  create  an  environment  in 
which  each  user  believes  that  he  has  the  complete  resources  of  a 
System/360  Model  65  and  can  perform  his  own  work  under  the  supervision 
of  the  programming  system  of  his  choice.  It  achieves  its  objective  by 
generating  a  'Virtual  computer"  for  each  user  and  by  sharing  the 
resources  of  the  real  computer  (e.g.  CPU  time,  main  storage)  among  the 
virtual  computers  for  all  users  that  are  concurrently  keyed  into  the 
system. 


Virtual  computers  can  be  defined  as  computers  which  function  like 
real  ones  but  which  are  the  products  of  software  simulation.  When  a 
user  identifies  himself  from  a  terminal,  the  control  program  "creates" 
for  his  personal  use  a  virtual  computer  of  a  predefined  configuration. 
The  systems  administrator  defines  this  configuration  of  each  user's 
virtual  machine  before  the  system  becomes  available  to  the  users.  This 
configuration  can  be  different  for  different  users.  To  the  user,  his 
virtual  computer  appears  real  and  he  uses  it  as  if  it  were.  The  control 
program  also  provides  commands  that  parallel  the  functions  of  switches 

operator's  console.  The  user  issues  these  commands  from  his 

16 


on  an 
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terminal,  and,  thus,  the  terminal  becomes  a  pseudo-console  for  his  vir¬ 
tual  machine. 

After  the  control  program  has  created  the  virtual  computer,  the 
user  equips  it  with  the  programming  system  that  gives  him  the  desired 
functional  capabilities.  The  user — not  the  system — determines  the  fa¬ 
cilities  available  to  him. 

The  minimum  hardware  configuration  required  by  CP/67  is  given 

below. 

1.  1^03  Printer; 

2.  2067-1  or  2067-2  CPU; 

3.  Two  2311  Disk  Drives  or  231*+  Storage  Unit; 

U.  2365  Core  Storage  Unit; 

5.  25*+0  or  2501  Card  Reader; 

6.  25*+0  Card  Punch; 

7.  2702(3)  Transmission  Control  Unit; 

8.  1052  On-line  Console  Typewriter. 

The  following  optional  devices  are  also  supported: 

1.  1051/1052  Data  Communications  System; 

2.  2250  Graphic  Display  Unit; 

3.  2820/2301  Drum  Storage  Controller ( s  )/Unit ( s ) ; 
k.  2*+00  series  Magnetic  Tape  Units; 

5.  27*+l  (-1,  -2)  Communications  Terminals. 

Sec.  3.2.  CP/67  -  Time-Sharing  Environment 


CP/67  creates  the  time-sharing  environment  by: 


' 
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1.  Scheduling  and  allocating  main  storage  space,  CPU  time,  and 
I/O  devices  to  the  virtual  computers. 

2.  Handling  all  interruptions  (see  Sec.  3.6.). 

3.  Protecting  system  files,  user  programs,  and  user  data  during 
ex e cut  ion. 

The  control  program  shares  execution  time  in  the  central  processing 
unit  (CPU)  among  the  virtual  computers  on  a  demand  basis  and  on  a  sched¬ 
uled  basis.  It  schedules  and  allots  units  of  CPU  time,  in  rotation,  to 
the  virtual  computers.  When  a  particular  virtual  computer  has  used  up 
its  unit  of  time  ("time  slice"),  the  control  program  locates  the  next 
"runnable"  virtual  computer  and  passes  control  to  it  for  a  corresponding 
interval  of  time.  If  the  virtual  computer  currently  in  control  must  wait 
for  some  event  (an  I/O  operation),  the  control  program  gives  control  to 
another  virtual  computer,  which  has  demanded  CPU  time.  The  detailed 
description  of  the  module  that  carries  out  these  tasks  is  given  in 
Sec.  3.3. 


The  control  program  also  uses  a  special  technique  to  share  main 
storage  among  concurrent  users.  This  technique  is  called  "paging."  The 
objective  of  this  technique  is  to  keep  in  main  storage  only  those  portions 
of  each  user's  program  that  are  required  at  that  time.  This  necessitates 
the  segmentation  of  each  user's  program  into  manageable  units.  The  units 
used  in  this  case  are  l*096-byte  blocks  called  "pages."  By  breaking 
programs  into  pages,  main  storage  can  be  allocated  in  page  increments  and 
pages  can  be  loaded  dynamically  for  execution.  Thus,  at  execution  time, 
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main  storage  holds  only  the  active  part  of  each  user's  program.  This 
paging  technique  is  described  in  greater  detail  in  Sec.  3.1+. 

The  control  program  must  also  handle  all  the  interruptions  and 
change  of  state  in  the  CPU  necessitated  by  program  or  machine  errors , 
completion  of  I/O  operations  and  other  conditions.  Interrupt  handling 
by  CP/67  is  described  in  Sec.  3.5. 

Sec.  3.3.  CP/67  ~  Dispatcher 

The  allotting  of  CPU  time  to  different  users  and  scheduling  user- 
program  execution  is  handled  by  the  control  program  routine  called 
DISPATCH.  When  all  interruption -handling  routines  complete  their  pro¬ 
cessing,  they  transfer  control  to  DISPATCH.  This,  then,  is  the  central 
routine  of  the  system  CP/67.  In  general  it  performs  two  functions ; 
first,  it  charges  time  used  within  the  control  program  to  the  appropriate 
user,  and  second,  it  determines  which  user  is  to  receive  control  next. 

DISPATCH  initially  receives  control  after  the  system  generation 
on  each  day  and  control  program  initialization.  It  remains  idle  until 
an  interrupt  occurs  (i.e.  a  user  signs  on).  The  appropriate  interrupt 
handler  will  log  the  user  on  and  return  control  to  DISPATCH.  Logging 
a  user  on  includes  the  following: 

1.  Allocating  and  initializing  the  primary  user  control  table 
(UTABLE) ; 

2.  Checking  user  ID  and  password; 

3.  Allocating  tables  and  reserving  direct  access  storage  space 


for  paging;  and. 


' 
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U.  Creating  virtual  I/O  blocks  to  describe  the  user’s  virtual 
machine  and  chaining  virtual  device  blocks  to  real  device 
blocks . 

There  is  one  UTABLE  for  each  user  in  the  system.  It  is  the 
primary  control  block  from  which  all  user  blocks  are  strung.  It  com¬ 
pletely  reflects,  along  with  the  virtual  I/O  blocks,  the  status  of  the 
corresponding  virtual  machine.  A  description  of  the  UTABLE  is  given 
in  Appendix  A. 

Each  time  DISPATCH  is  entered,  the  time  used  by  the  current 
(interrupted)  user  within  the  control  program  is  computed  and  added  to 
the  TIMEUSED  entry  in  the  user's  UTABLE.  If  the  current  user  has  not 
exhausted  his  allotted  time  for  this  quantum  (time  slice),  he  will  be 
eligible  for  restart.  In  this  case  his  CPRQUEST 's  are  honoured,  his 
pending  interrupts  are  reflected  and  then  if  not  in  "wait"  state  (i.e. 
runnable)  he  is  restarted.  If  no  time  remains  for  the  interrupted  user, 
the  next  runnable  user  is  selected. 

DISPATCH,  upon  each  entry  to  it,  and  prior  to  the  running  of  any 
user,  checks  the  queues  of  control  program  execution  requests  (CPRQUEST) 
for  any  pending  work.  If  any  requests  are  found,  the  appropriate 
execution  request  block  (CPEXBLOK)  is  used  to  load  the  registers  and 
dispatch  control  to  a  specified  section  of  the  control  program.  This 
section,  on  completion,  returns  control  to  DISPATCH.  If  the  current 
user  is  not  runnable  and  if  all  CPRQUEST  stacks  are  empty,  then  a  new 
user  is  selected  to  run. 


In  order  to  prevent  paging  overload,  the  system  allows  only  a  subset 


1.  >c. 

. 


21 


of  the  users  to  run  at  any  given  time.  Interactive  users  (needing  little 
compute  time)  are  in  and  the  users  who  put  a  heavy  load  on  the  system 
in  terms  of  CPU  cycles  required  or  amount  of  nonterminal  I/O  done,  are  in 
Q2<  There  is  a  maximum  limit  on  both  and  Q^,  which  is  set  by  system 
administrators  and  partially  dependent  upon  the  real  core  size  of  the  com¬ 
puter  . 

A  user  is  in  one  of  the  following  five  states  at  any 

time : 

1.  In  Q  ;  -  State  A 

2.  Waiting  to  get  into  Q^;  -  State  B 

3.  In  Q0;  -  State  C 

4.  Waiting  to  get  into  Q2  -  State  D;  and 

5.  Dormant,  not  requiring  system  resources;  -  State  E. 

In  addition,  a  user  may  or  may  not  be  runnable,  regardless  of 
whether  he  is  in  the  queues.  A  user  is  not  runnable  if  he  is  waiting 
for : 

1.  A  page  to  be  brought  in; 

2.  An  I/O  operation;  and, 

3.  A  CP  console  function. 

In  order  to  select  a  new  user,  the  first  preference  is  given  to 
a  runnable  user  in  Q_^.  If  no  such  user  is  found  and  Q,-^  is  not  xull  and 
a  runnable  user  is  waiting  to  get  into  Q.^,  then  that  user  is  selected.  Ii 
no  such  user  is  found,  then  a  runnable  user  will  be  selected  from  Q2*  If 
none  exists,  and  Q0  is  not  full,  a  runnable  user  waiting  to  get  into  Q2 
will  be  selected.  If  still  no  user  is  found,  then  CP  goes  into  WAIT  state. 
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To  start  (or  restart)  a  user,  DISPATCH  loads  the  appropriate 
control  registers  from  the  contents  of  the  chosen  user's  UTABLE  entries, 
loads  the  interval  timer  with  the  user's  quantum  (0.05  seconds),  or 
the  unused  portion  of  it,  and  gives  control  to  the  user  by  entering  the 
problem  mode. 

The  management  of  and  is  as  follows :  A  user  enters  State  A 
(in  Q^)  only  from  State  B  (waiting  to  get  into  Q  ),  and  whenever  Q  is 
not  full.  A  user  enters  State  B  when  he  has  a  read  operation  on  his 
terminal.  When  in  Q  ,  users  are  allowed  to  use  up  to  0.4  seconds  of 
accumulated  CPU  time.  One  condition  to  remain  in  is  never  to  use  a 
full  time  slice  (quantum)  or  0.05  seconds  at  any  one  time.  If  a  user 
uses  a  full  time  slice  without  any  console  function,  or  uses  0.4  seconds 
in  bursts  of  less  than  0.05  seconds,  he  enters  State  D  (waiting  to  get 
into  Q  ).  A  enters  State  C  (in  Q^)  from  State  D  when  Q0  is  not  full. 

When  in  Q  ,  users  are  allowed  to  use  5  seconds  of  CPU  time,  again  in 
bursts  of  0.05  seconds.  A  user  enters  State  D  again  if  he  uses  a  full 
time  slice  in  one  burst,  or  uses  an  accumulated  5  seconds  in  bursts  of 
less  than  0.05  seconds.  When  there  are  many  users  in  State  D,  the 
selection  to  enter  State  C  is  according  to  the  number  of  time  slices  used 
by  different  users.  A  user,  who  has  used  the  least  number  of  time  slices, 
is  selected  since  he  has  the  highest  priority.  (All  times  given  in  this 
paragraph  are  adjustable  system  constants  and  are  the  values  used  at 
The  University  of  Alberta. ) 

Gee.  3.4.  CP/67  -  Paging 

The  paging  technique  employed  by  CP  to  share  the  main  storage  is 


discussed  here. 
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When  a  user  starts  his  session,  the  control  program  places  the  first 
page  of  the  user’s  programming  system  into  main  storage.  The  page  is 
loaded  into  an  available  block  of  main  storage  that  starts  on  a  page 
boundary.  The  page  is  not  necessarily  loaded  at  the  same  relative  main 
storage  position  as  it  would  occupy  were  the  programming  system  running 
on  a  real  computer. 

As  the  user's  program  is  executed,  the  hardware  dynamically  con¬ 
verts  references  to  relative  (virtual)  addresses  into  actual  main  storage 
addresses.  When  the  program  refers  to  an  address  in  a  page  that  is  not 
in  main  storage,  an  interruption  occurs  and  the  control  program  loads 
the  required  page  into  main  storage.  While  this  is  being  done,  the 
current  user  (say  user  A) is  put  in" PG WAIT"  and  another  user  is  selected 
to  run.  When  user  A  is  selected  to  run  at  a  later  instant,  execution 
continues  with  the  referenced  addresses  being  dynamically  relocated. 

Because  of  this  dynamic  address  relocation  feature,  the  pages  of 
a  user's  program  need  not  occupy  contiguous  locations  and  may  be  scattered 
throughout  main  storage.  Also,  because  of  the  high  demand  for  main 
storage  in  a  multiple-user  environment,  the  control  program  shares  main 
storage  among  the  active  pages  of  the  programming  systems  of  competing 
users . 


Finally,  when  main  storage  is  completely  filled,  and  it  becomes 
necessary  to  bring  in  another  page,  page  swapping  occurs.  An  appropriate 
page  of  a  user's  program  in  main  storage  is  transferred  to  secondary 
storage  (e.g.  disk  or  drum)  and  the  required  page  is  brought  into  main 
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storage  in  its  place.  If  a  page  to  be  replaced  has  previously  been 
swapped,  and  has  not  been  modified,  since  it  was  last  swapped,  it  is  not 
necessary  to  write  it  onto  secondary  storage  because  a  copy  already  exists 
there.  When  the  particular  page  that  was  replaced  is  required,  it  is 
obtained  from  secondary  storage  and  swapped  with  one  that  is  in  main 
storage. 

Sec.  3‘5‘  CP/67  -  Interrupt  Handling 

Before  proceeding  with  the  interruption-handling  by  CP,  it  is 
necessary  to  define  the  three  possible  states  of  the  real  computer.  When 
instructions  in  the  control  program  are  being  executed,  the  real  computer 
is  in  the  supervisor  state;  at  all  other  times  it  is  in  the  problem  state, 
unless,  of  course,  no  user  is  runnable;  in  this  case  the  computer  is 
in  "idle"  state. 

The  interruption  handling  routines  determine  the  cause  of  the 
interrupt  by  checking  the  machine  state.  The  reason  is  to  charge  time 
to  a  user  for  the  particular  interrupt,  if  the  machine  (real  computer)  is 
found  to  be  in  problem  state.  If  the  machine  is  found  to  be  in  supervisor 
state,  then  no  user  is  charged  for  interrupt  handling.  An  example  of  the 
latter  is  the  hardware  timer  interrupt  every  0.05  seconds. 

There  are  four  major  types  of  interruptions  handled  by  CP. 

1.  Supervisor  Call  (SVC)  interrupts; 

2.  External  interrupts  -  These  include  timer  interruption; 

Program  interruptions  -  These  are  caused  by  paging  interrupts 
and  privileged  instructions; 


3. 
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I/O  interruptions  -  These  are  caused  by  I/O  requested  by  a 
user  or  by  CP  itself. 

In  the  next  section,  the  description  of  the  CP/67  system,  as  it 
exists  at  The  University  of  Alberta  Computing  Centre,  is  given. 

Sec.  3.6.  CP/67  -  The  University  of  Alberta  Configuration  (June,  1969) 

The  CP/67  system  at  this  campus  operates  for  four  hours  a  day 
on  the  IBM  Series  360,  model  67  computer.  A  diagram  representing  360/67 
is  given  in  Fig.  3.1. 

The  two  main  programming  systems  available,  and  used,  are  CMS 
(Cambridge  Monitor  System)  and  APL  (A  Programming  Language).  The 
operating  system,  0S/360  has  been  tried  under  CP. 

CMS  is  the  monitor  that  creates  the  conversational  part  of  the 
time-sharing  environment.  CMS  could  be  used  without  CP/67  to  create  a 
conversational  system  without  time-sharing  capabilities.  When  jointly 
used,  the  monitor  executes  under  the  supervision  of  the  control  program. 
360 /Assembler,  and  FORTRAN  IV  are  available  through  CMS. 

APL  is  implemented  in  a  very  special  way.  One  virtual  machine 

l 

defined  as  a  System/360  Model  65  is  equipped  with  the  Disk  Operating 
System  (DOS)  which  supports  the  APL/36O  time-sharing  system.  Thus,  one 
time-sharing  is  supported  by  another  time-sharing  system.  Usually 
about  half  of  the  terminal  users  are  using  the  APL  virtual  machine. 

V  ,  .  vy 

APL  machine  has  the  special  scheduling  priority  so  that  it  remains  in 
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Two  other  points  to  remember  are : 

1.  The  maximum  lengths  of  and  are  fixed  at  9  and  6  user 
requests,  respectively. 

2.  Paging  is  carried  out  from  2314  Disk  Storage  Device. 

To  analyze  this  system,  a  statistical  evaluation  model  was 
constructed.  Care  was  taken  to  avoid  using  extreme  amounts  of  processing 
time  and  available  core  while  gathering  data.  This  resulted  in  careful 
choosing  of  performance  criteria  for  the  system.  This  particular 
model  is  described  next. 


CHAPTER  IV 


EVALUATION  MODELS 

The  considerations,  that  need  to  be  accounted  for,  in  the  statis¬ 
tical  evaluation  of  any  system,  have  been  mentioned  previously  in  Sec. 

1.3.  and  also  at  the  end  of  the  last  chapter.  The  most  cumbersome  problem 
to  consider  is  that  of  data  reduction.  The  stress,  again,  is  put  on 
obtaining  maximum  information  from  minimum  amount  of  data  without  making 
too  many  restricting  assumptions. 

Sec.  U.l.  The  Three  Models 

The  three  models  used  to  evaluate  CP/67  statistically  are  the 
following : 

1.  System  evaluation — this  requires  data  on  system  parameters; 
for  example,  the  time  spent  by  the  system  in  CP  or  supervisory 
mode . 

2.  User-states  evaluation — this  requires  data  on  user-parameters 

such  as  the  number  of  users  in  Q,  and  Q„. 

1  c 

3.  APL  evaluation — this  requires  data  on  the  user  APL  only 
without  considering  other  users  (see  Fig.  3.2). 

These  three  models  are  discussed  in  the  next  three  sections.  The 
methods,  for  statistical  evaluation,  are  discussed  in  general  terms  in 
Sec.  U.5.,  and,  the  method  used  is  described  in  more  detail  in  Sec.  U.6. 
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Sec,  h.2.  System  Parameters 

The  control  program  accumulates  certain  statistics  for  the  time- 
period  during  which  the  CP/67  system  is  operating  on  a  particular  day. 
The  parameters  for  the  evaluation  of  system  performance  were  selected 
from  these,  and  are  listed  below: 


1.  NUMUSERS:  A  half  word  (2  bytes)  in  memory  is  occupied  by  NUMUSERS. 

It  is  the  number  of  users  logged  on  the  system  at  any  in¬ 
stant.  This,  of  course,  represents  the' real  external  load 
(i.e.  number  of  virtual  machines)  on  the  system. 

2.  CPTIME  :  A  full  word  bytes)  in  memory  is  occupied  by  CPTIME. 

CPTIME  is  the  time  spent  by  the  system  in  CP  mode;  i.e. 
time  spent  by  the  system  executing  control  program  instruc¬ 
tions  in  the  supervisory  state. 

3.  OVERHEAD:  OVERHEAD  occupies  a  full  word  in  memory.  OVERHEAD  is  the 

time  spent  (in  DISPATCH)  to  find  a  new  user  to  run  after 
it  has  been  determined  that  the  current  user  is  not 
runnable.  This  value,  hence,  is  included  in  CPTIME  as 
illustrated  in  Fig.  l+.l. 


k.  WAITTIME:  This  occupies  a  full  word  in.  memory.  WAITTIME  is  the  time 

spent  by  the  system  when  DISPATCH  does  not  find  a  single 
runnable  user.  DISPATCH  remains  idle  until  an  interrupt 
occurs  (usually  caused  by  change  in  some  user's  state). 

5.  PGREAD  :  PGREAD  is  the  total  number  of  pages  read  in  since  system 


initialization  on  a  particular  day. 


. 
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6.  PGSWAP  :  This  is  the  total  number  of  pages  swapped  out  since  system 


initialization  on  a  particular  day. 


The  parameters  CPTIME,  OVERHEAD,  and  WAITTIME  indicate  the  fraction 
of  the  time  the  system  spends  in  different  modes.  The  time  spent  by  the 
system  in  problem  mode  (this  is  not  accumulated  by  the  system)  can  be 


DISPATCH 
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« -  CPTIME 

k  - 

IOINT 

handles  interrupt 


IK 


■> 

IK 


OVERHEAD 


Interrupt  occurs  Current  user  not  runnable  Next  user  found 

(say,  I/O  interrupt)  to  run 


Fig.  1.1.  Format  of  CPTIME 

determined  by  the  following  relation: 

PROBTIME  =  TOTAL  TIME  -  (CPTIME  +  WAITTIME). 

where  PROBTIME  is  the  time  spent  in  problem  mode  and  TOTALTIME  is  the 
time  from  system  initialization  to  the  time  of  observation. 

The  parameters  OVERHEAD  and  CPTIME  also  indicate  the  efficiency 
of  the  main  control  routine  DISPATCH.  They  are,  in  a  way,  measures  of 
the  scheduling  policy.  Too  much  time  spent  in  the  OVERHEAD  part  of  CP 
mode  would  imply  the  ineffectiveness  of  the  scheduling  policy;  although 
the  blame  should  not  fall  on  the  scheduling  algorithm  entirely.  Users 
can  also  degrade  the  performance  of  DISPATCH.  The  larger  the  number  of 
"not-runnable"  users,  the  more  OVERHEAD  and  CPTIME  accumulate.  This 
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happens  because  DISPATCH,  in  trying  to  find  the  first  "runnable"  user, 
has  to  check  all  the  "not-runnable"  users  in  turn,  thus  taking  more  time. 
Similarly,  users  performing  "I/O  bound"  jobs,  or  "non-compute"  bound  jobs 
would  necessarily  require  more  service  from  the  CP  supervisor. 

The  last  two  parameters  occupy  a  full  word  each,  in  memory.  To¬ 
gether,  they  indicate  the  paging  load  on  the  system.  Since  paging  is 
done  from  disk,  they  also  indicate  the  part  of  the  load  on  the  disk 
storage  device,  and  the  channel  activity  associated  with  the  information 
transfer  to  and  from  this  device. 

The  only  other  system  parameter  considered  is  the  "real  time  of 
day"  to  determine  the  time  observations  are  taken,  and  the  interval 
between  successive  observations.  This  value  is  accumulated  in  "hours- 
minutes-seconds"  only. 

Sec.  ^.3.  Virtual  Machine  Parameters 

The  second  evaluation  model  determines  user-states  through  the 
following  two  entries  in  each  user's  UTABLE. 

(i)  TIMINQ;  (ii)  VMSTATUS. 

TIMINQ  is  the  accumulation  of  time  spent  by  the  user  in  Q^  or  Q  . 
This  is  reset  to  zero,  everytime  he  leaves  Q^  or  Q^.  The  following 
information  can  be  obtained  from  the  word  TIMINQ: 

If  TIMINQ  =  0,  then  the  user  is  neither  in  Q  nor  in  Q2« 

If  TIMINQ  i-  0,  then  the  user  is  either  in  Q  or  in  Q^. 

One  important  point  to  remember  is  that  only  the  first  3  bytes  of  TIMINQ 

represent  the  value  of  the  time  spent  in  Q^  or  Q^.  The  last  byte  of  the 
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word  is  labelled  NOQUANT  and  can  contain  any  value  from  0  to  127.  NOQUANT 
is  the  count  of  the  number  of  time  slices  used  by  the  user,  while  in  Q  . 

In  addition,  the  first  bit  of  NOQUANT  is  labelled  CONQBIT  and  is  used  to 

decide  which  queue  the  user  is  in  (see  Fig.  h.2). 

If  CONQBIT  =  0,  then  the  user  is  a  member  of,  or  eligible  to  enter  Q  . 

If  CONQBIT  =  1,  then  the  user  is  a  member  of,  or  eligible  to  enter  Q^. 

The  format  of  TIMINQ  is  illustrated  in  Fig.  h.2. 

NOQUANT 

i  i  I  I  I  I  I  I  ' 

The  value  of! time  spent ! in  Q 

_ _  _ _ i _ _ _ i _ 1 _ _ _ _ _ _ 111., 

0  12  3*+ 

CONQBIT 

Fig.  h.2.  TIMINQ 

From  now  on,  reference  to  TIMINQ  would  imply  just  the  first  three  bytes 
of  the  word. 

Combining  TIMINQ  and  CONQBIT,  the  following  deductions  can  be 

made : 

If  TIMINQ  ^  0  and  CONQBIT  =  1,  then  the  user  is  in  Q  . 

If  TIMINQ  ^  0  and  CONQBIT  =  0,  then  the  user  is  in  Q  . 

If  TIMINQ  =  0  and  CONQBIT  =  1,  then  the  user  is  eligible  to  enter  Q1  or 

the  user  is  in  console  function  wait  (defined  below). 

Finally , 

if  TIMINQ  =  0  and  CONQBIT  =  0,  then  the  user  is  eligible  to  enter  Q0. 
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To  determine  the  number  of  users  eligible  for  and  to  investigate 
user-states  in  more  detail,  the  following  parameter  is  necessary. 


VMSTATUS  indicates  the  state  of  the  user's  machine.  There  are 
eight  possible  states  that  can  be  tested  but  only  the  following  four 
were  selected  (see  Appendix  A,  UTABLE,  for  description  of  the  other 
four  states): 


PGWT 


IOWT 


Indicates  whether  or  not  the  user  is  waiting  for  a  page  to 
be  brought  in  from  secondary  storage  (DISK)  to  main 
storage. 

Indicates  whether  or  not  the  user  is  waiting  for  an  I/O 


operation  to  be  completed. 


CFWT 


Indicates  whether  or  not  the  user  is  waiting  to  perform 
a  console  function.  Obtaining  this  information  helps 
in  deciding  whether  the  user  is  eligible  for  Q  or  waiting 
to  carry  out  a  console  function. 


EXCFN 


Indicates  whether  the  user  is  actually  executing  a 


console  function. 


These  four  states  are  represented  in  the  first  byte  of  VMSTATUS 
by  a  "l"  in  first,  second,  third,  or  fourth  bit,  respectively. 


0 

0 

0 
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Fig.  U.3.  VMSTATUS 
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VMSTATUS  value  shown  in  Fig.  U.3  implies  that  the  corresponding  virtual 
machine  is  in  IOWT. 


.  Kence  TIMINQ  and  VMSTATUS  determine  users’  states  adequately 
(see  Sec.  3. 3). 


For  example,  a  user  can  have  in  his  UTABLE, 


VMSTATUS  = 


1 

0 

0 

0 

0 

0 

0 

0 

TIMINQ  ±  0,  and  CONQBIT  =  1. 


This  would  imply  that  the  user  is  in  Q  and  waiting  for  a  page  to  be 
brought  in. 

Sec.  U.4.  APL-Users*  Parameters 

As  mentioned  before,  at  the  time  of  observation  APL  represented 
at  least  50%  of  the  user  population  using  CP  (see  Fig.  3.2).  This 
classifies  APL  as  a  predominant  user,  and  its  evaluation  deserves  special 
treatment . 

The  APL-users'  UTABLE  is  investigated  in  more  detail  to  get  a 
closer  look  at  its  performance.  The  following  parameters  are  selected 
for  this : 

1.  TIMINQ. 

2 .  VMSTATUS . 


3.  TIMEUSED:  This  occupies  a  full  word  in  the  UTABLE;  TIMEUSED  is  the 
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total  time  used  since  log  on  (problem  state  +  CP  overhead 
charged  to  APL).  This  gives  the  load  APL  puts  on  the 
CP/67  system  as  far  as  compute-time  is  concerned. 

NUMPAGES  and  PRIORIT:  These  two  together  occupy  a  full  word  in  the 
UTABLE.  NUMPAGES  is  the  number  of  pages  APL  has  in  core 
at  a  particular  time  instant.  PRIORIT  is  important  if 
APL  ever  enters  Q  .  If  it  does,  then  this  gives  the 
priority  of  APL  in  Q  . 

5.  VTOTTIME:  This  occupies  a  full  word  in  the  UTABLE.  VTOTTIME  is 

the  total  problem-state  time,  since  log  on,  used  by 
APL.  Hence,  (TIMEUSED-VTOTTIME )  gives  the  CP  overhead 
charged  to  APL. 

All  these  parameters  were  selected  for  one  basic  reason,  namely 
to  indicate  and  classify  the  load  APL  puts  on  the  CP/67  system,  the 
first  two  have  been  described  in  Sec.  4.3.  NUMPAGES  and  VTOTTIME  indi¬ 
cate  paging  and  CPU  load,  respectively,  and  TIMEUSED  minus  VTOTTIME 
gives  the  CP  overhead  charged  to  APL. 

The  number  of  users  using  APL  is  obtained  from  a  fixed  location 
in  the  memory  (NUMDIAL).  The  variation  of  the  state  of  the  APL  machine 
for  the  different  number  of  users  connected  is  important,  because  future 
prediction  of  the  load  APL  can  handle  can  be  obtained  through  this. 

Sec.  4.3.  Possible  Data  Gathering  Techniques 


The  selection  of  all  the  parameters  for  the  evaluation  models 


' 


37 


are  a  direct  result  of  the  methods  available  to  gather  data  on  CP/67. 

There  are  two  approaches  possible. 

1.  DISPATCH  has  a  subroutine  called,  DEMON,  which  is  used  mainly  for 
updating  real  time  every  second.  There  is  an  entry  to  DEMON  every  quarter 
second  on  the  average.  Hence  after  every  four  entries  to  DEMON,  the 
time  is  updated.  This  routine,  then,  can  be  extended  to  gather  statistics 
at  each  updating  of  real  time. 

The  two  main  disadvantages  of  this  method  are  the  following. 

First,  it  would  require  some  "patching”  or  rewriting  of  the  control  pro¬ 
gram.  A  storage  area  would  have  to  be  kept  free  to  accumulate  different 
statistics.  This  would  encroach  on  the  free  storage  available  to  the 
control  program.  Moreover,  "playing  around"  with  the  software  of  a 
system  is  always  dangerous.  The  chances  of  bad  linkages  in  inserting 
a  "stat-gathering"  program,  are  very  high.  In  short,  this  would  put 
unnecessary  pressure  on  the  control  program. 

Second,  the  gathering  of  user-data  (2nd  and  3rd  part  of  the  model)  seems 
very  difficult.  UTABLE's  are  not  in  fixed  location  in  the  memory,  but 
are  "floating."  They  are  linked  together,  but  not  necessarily  in  a 
contiguous  memory  block.  The  approach,  through  DEMON  ,  then  would  make 
the  system  spent  too  much  time  in  CP-mode  trying  to  locate  the  required 
data. 


One  advantage  of  this  method  is  that  the  time  interval  between 
successive  readings  can  be  made  very  small  by  activating  DEMON  more  often 


. 
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(say,  everytime  DISPATCH  is  entered).  This  again  would  result  in  a  deg¬ 
radation  of  "user-response  time"  due  to  excessive  time  spent  in  CP-mode. 

This  method,  then,  was  discarded  for  the  above  reasons  and  the 
following  approach  was  used: 

2.  The  other  method  available  for  such  a  statistical  evaluation  model 
uses  a  virtual  machine  (and  hence  is  a  "user"  of  CP/67)  to  look  at  the 
specified  parameters  periodically.  The  need  for  any  "patching"  of  the 
control  program  becomes  nonexistent.  The  load  added  to  the  system  can  be 
considered  as  just  another  user  (virtual  machine)  using  the  system. 

Through  the  use  of  the  DIAG  (Diagnose)  instruction,  all  the  nec¬ 
essary  data  can  be  obtained.  The  description  of  this  method,  and  the 
program  used  to  construct  the  evaluation  model  is  given  next. 

Sec.  4.6.  The  Data  Gathering  Technique  Used 

The  description  of  the  DIAG  instruction  is  given  first  in  order 
to  discuss  the  method  more  meaningfully. 

DIAG  is  a  privileged  instruction  (code  ’83' )•  To  use  it,  a  user 
requires  the  priority  class  A  or  C  in  the  CP/67  system,  where 
class  A  -  corresponds  to  the  systems  operator; 

class  B  -  corresponds  to  the  systems  administrator; 

class  C  -  corresponds  to  the  subordinate  systems  operator;  and 

class  D  -  corresponds  to  the  system  user. 

This  instruction  is  used  for  diagnostic  purposes  usually  and  hence, 
has  the  ability  to  look  at  specified  memory  locations.  This  ability  is 
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used  to  gather  statistics.  The  instruction  is  used  in  the  following 
" 360/As sembler"  sequence  of  instructions: 

LA  1,  LIST 

LA  2,  NOLIST 

LA  3,  STL I ST 

DIAG  1 ,  3 

The  address  of  the  list  of  memory  locations  to  look  at  (LIST)  is  loaded 
in  Register  1.  The  number  of  entries  in  this  list  (NOLIST)  is  loaded 
into  Register  2  and  the  address  of  the  storage  area  (STLIST)  where  the 
values  from  the  specified  memory  locations,  are  to  be  stored,  is  loaded 
in  Register  3.  Then  DIAG  instruction  obtains  these  values  and  stores 
them  in  the  specified  storage  area. 

The  virtual  machine  with  the  identification  name  CPSTATS  was 
used  for  this  model.  The  priority  class  C  was  assigned  to  it. 

A  program  CPSIiOOP  (listed  in  Appendix  B)  was  written  in  360/ 
Assembler  and  used  some  of  the  CMS  macros  available.  The  instruction 
DIAG  was  used  throughout  the  program  to  gather  statistics.  Algorithm 
U.l  and  the  flowchart  in  Fig.  U.U  describe  CPSN00P  in  detail  indicating 
the  method  used  to  perform  the  required  tasks  for  the  first  two  models. 

Algorithm:  U.l. 

Step  1  Advance  the  buffer  pointer  32  bytes  to  allow  space  for  the  infor¬ 
mation  to  be  brought  in.  If  the  buffer  area  (800  bytes)  will 
not  handle  the  additional  32  bytes,  write  the  information  already 
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in  the  buffer  and  set  the  pointer  to  32  before  going  to  step  2. 

Step  2  Use  DIAG  instruction  to  pick  up  the  system  parameters  listed  in 
Sec.  4.2  and  store  the  values  in  the  buffer  area.  Go  to  step  3. 

Step  3  Use  RUNUSER  (UTABLE  link)  to  load  the  current  user’s  (in  this 

case  stat-gatherer — CPSTATS)  UTABLE  and  again  use  DIAG  instruction 
to  pick  up  the  pointer  to  NEXTUSER  in  the  UTABLE  chain.  Go  to 
step  4. 

Step  4  Load  next  user’s  UTABLE,  pick  up  user  parameters  as  listed  in 

Sec.  4.3  and  also  the  "link"  to  the  next  user.  Move  pointer  in 
the  buffer  to  8  bytes  further  and  test  the  "buffer- full"  condi¬ 
tion  as  in  step  1.  If  buffer  can  handle  8  bytes  more,  store 
information  in  buffer,  otherwise,  write  the  information  in  the 
buffer  on  disk  and  continue  after  resetting  the  pointer  in  the 
buffer  to  8  bytes.  Go  to  step  5- 

Step  5  Test  if  the  next  user  is  the  RUNUSER  (CPSTATS);  if  it  is,  then 
go  to  step  6;  if  it  is  not  then  go  to  step  4. 

Step  6  Perform  some  console  function  (fixed)  and  then  go  to  step  1. 

The  step  that  ends  this  program  is  not  included  because  it  depends 
on  the  size  of  the  sample  required. 

The  purpose  of  performing  the  console  function  in  step  6  is  to 
keep  the  stat-gathering  virtual  machine  (CPSTATS)  in  the  high  priority 
queue,  ,  and  also  to  control  the  time  interval  between  successive 

The  virtual  machine  waits  for  this  console  function  (in  this 
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case,  3  sequences  of  "space-backspace”),  and  then  picks  up  system  param¬ 
eters  again.  This  "wait"  results  in  readings  taken  every  second. 

A  buffer  area  of  800  bytes  is  also  created  to  store  the  information 
gathered,  sequentially.  When  this  area  gets  full,  the  information  is 
written  out  on  disk  before  proceeding  any  further.  A  pointer  is  moved 
in  the  buffer  as  information  is  stored  so  that  no  information  is  lost 
through  overlapping. 

After  running  the  program  for  a  specified  time  (say,  20  minutes), 
another  program  ANALYSE  is  called.  ANALYSE  converts  the  data  from 
binary  to  decimal,  tests  TIMINQ  and  VMSTATUS  readings  to  decide  the 
user-states,  and  makes  up  counts  of  the  number  of  users  in  different 
states.  This  information,  then,  is  punched  out  on  cards.  For  each  set 
of  readings  (i.e.  for  each  interval  of  1  second),  one  card  is  punched. 
Table  4.1  gives  the  format  of  information  punched  out  on  cards  by 
ANALYSE. 

The  third  model  which  is  used  to  evaluate  APL,  needs  a  separate 
program  APLDATA.  This  program  is  very  similar  to  CPSN00P  and  is  described 
in  Algorithm  4.2.  and  the  flowchart  Fig.  4.5.  A  program  ANAPL  was 
written  (quite  similar  to  ANALYSE)  to  convert  these  data  into  decimal 
and  punch  it  out  on  cards. 

Algorithm:  4.2. 

Step  1  Advance  the  buffer  pointer  12  bytes  to  allow  space  for  the 

information  to  be  brought  in.  If  the  buffer  area  (800  bytes) 
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TABLE  U.l 


FORMAT  OF  INFORMATION  PUNCHED  OUT  ON 

CARDS  BY  ANALYSE 

Card  Columns 

Variable 

1-  6 

Time  of  day  in  seconds 

7-  9 

Number  of  users 

10-16 

CPTIME  in  seconds 

17-23 

WAITTIME  in  seconds 

2U-30 

OVERHEAD  in  seconds 

31-37 

Number  of  pages  read  in 

38-41+ 

Number  of  pages  swapped  out 

45—47 

Number  of  users  in  page  wait 

U8-50 

Number  of  users  in  I/O  wait 

51-53 

Number  of  users  in  Console  function  wait 

54-56 

Number  of  users  executing  Console  function 

57-59 

Number  of  users  in 

60-62 

Number  of  users  in 

6  3-65 

Number  of  users  eligible  to  enter 

66-68 

Number  of  users  eligible  to  enter 
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"will  not  handle  the  additional.  12  bytes  ,  write  the  information 
already  in  the  buffer  and  set  the  pointer  to  12  before  going  to 
step  2. 

Step  2  Use  DIAG  instruction  to  pick  up  RUNUSER,  time  of  day,  and  number 
of  users  using  APL  (NUMDIAL) ,  and  store  it  in  the  buffer  area. 

Go  to  step  3. 

Step  3  Get  the  address  from  RUNUSER,  load  CPSTAT's  UTABLE  and  use  DIAG 
to  pick  up  the  pointer  to  next  user.  Go  to  step  4. 

Step  4  Load  next  user's  UTABLE  and  pick  up  USERID  (user  identification) 
and  pointer  to  next  user.  Go  to  step  5- 

Step  5  Test  this  USERID  to  see  if  it  is  APL;  if  it  is  then  go  to  step  6 
if  it  is  not,  go  to  step  4. 

Step  6  From  APL's  UTABLE  select  the  parameters  listed  in  Sec.  4.4.  Set 
the  pointer  20  bytes  further  and  test  if  the  buffer  can  handle 
these  20  more  bytes.  If  it  can,  store  this  information  from 
UTABLE  and  go  to  step  J.  If  it  cannot,  then,  write  the  infor¬ 
mation  in  the  buffer  on  disk,  set  pointer  to  20  and  go  to  step  7 

Step  7  Perform  some  fixed  console  function  and  then  go  to  step  1. 

The  programs  CPSNOOP,  ANALYSE,  APLDATA,  and  ANAPL  were  used  for 

a  one-month  period  from  the  end  of  June  to  the  end  of  July,  1969.  The 

CP/67  system  ran  from  3  p.m.  to  7  p.m.  every  day.  An  average  user  popu¬ 
lation  of  about  8  to  10  users  existed  during  most  of  these  four  hours. 

The  high  numbers  of  users  were  logged-on  between  3.30  p.m.  and  4.30  p.m. 

usually.  A  low  value  of  number  of  users  was  found  between  6  p.m.  and 
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7  p.m.  with  the  lowest  value  of  3  users  obtained  only  after  6.30  p.m. 

The  next  chapter  describes  the  physical  restrictions  on  the  models  and 
the  data  obtained  using  these  models.  These  data  are  analyzed  in  detail. 


and  conclusions  are  drawn. 


CHAPTER  V 


ANALYSIS  OF  RESULTS 

The  analysis  of  the  data  collected  for  a  period  of  one  month  is 
given  in  this  chapter.  The  samples  consisted  of  1200  observations, 
which  were  taken  at  one-second  intervals  for  twenty  minutes.  This 
sample  size  proved  to  be  convenient  as  far  as  restricting  the  mass  of 
data  and  keeping  the  number  of  users  nearly  constant  in  each  sample. 

The  amount  of  data  had  a  physical  restriction  in  that  only  six  cylinders 
of  disk  storage  space  were  available.  This  space  was  used  to  store  the 
data  before  they  were  punched  onto  cards.  During  the  month  of  data- 
gathering,  the  CP/ 67  system  ran  for  four  hours  every  day.  On  the 
average,  eight  users  (virtual  machines)  were  logged-on  for  two  to  three 
hours.  To  obtain  observations  for  other  values  of  the  number  of  users, 
"twenty-minute"  samples  were  found  to  be  effective.  A  low  number  of 
users  usually  existed  only  for  the  last  twenty  minutes  of  the  time  the 
CP/67  system  was  "on."  The  change  in  the  number  of  users  of  the  APL 
virtual  machine  was  very  rapid.  Hence,  to  obtain  a  sample  with  a 
constant  number  of  APL  users,  the  time  interval  for  sample  observations 
was  set  to  ten  minutes.  The  analysis  of  the  APL  observations  is  given 
in  Sec.  5-6. 


Sec.  5.1  describes  the  transformations  applied  to  each  of  the 
variables  used  to  estimate  the  system  and  user  parameters.  The 
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randomness  of  the  samples  is  discussed  in  Sec.  5 >2,  while  the  actual 
analysis  of  the  data  is  described  in  Sections  5*3>  5.U,  and  5*5* 

Sec.  5.1.  Transformations  of  Variables 

Considering  the  first  and  second  model,  the  values  of  the 
system  variables  (see  Sec.  h .2)  -  Time  of  day;  NUMUSERS,  CPTIME, 
WAITTIME,  OVERHEAD,  PGREAD,  PGSWAP  -  and  the  user  variables  (see  Sec. 
U.3)  -  TIMINGQ  and  VMSTATUS  -  were  obtained  at  one-second  intervals. 
The  logical  tests  on  TIMINQ  and  VMSTATUS,  described  in  Sec.  1+.3, 
yielded  the  following  eight  variables : 


1. 

Number 

of 

us  ers 

in  PGWT; 

2. 

Number 

of 

users 

in  IOWT; 

3. 

Number 

of 

users 

in  CFWT; 

4. 

Number 

of 

users 

executing 

console  function  (EXCFN); 

5. 

Number 

of 

users 

in  Qx; 

6 . 

Number 

of 

users 

in  Q2; 

7. 

Number 

of 

users 

eligible 

for 

h 

(EQ1 ) ; 

8. 

Number 

of 

users 

eligible 

for 

Q2 

(EQ2). 

These  logical  tests  are  part  of  the  program  ANALYSE,  described  in 
Sec.  U.6. 

The  following  transformations  of  variables  were  performed 
before  the  data  on  punched  cards  were  written  onto  tape.  Time  of  day 
was  checked  first  to  see  if  the  readings  were  taken  at  one-second 
intervals.  It  was  found  that  after  every  n  seconds,  where  n  was 
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different  for  different  samples,  a  reading  was  skipped.  Instead  of 
one-second,  a  two-second  time  interval  between  successive  readings 
was  observed.  Whenever  the  program  CPSN00P  had  to  write  the  data  from 
the  buffer  onto  disk  storage,  "data-gathering"  stopped,  and,  thus,  one 
reading  was  skipped.  The  lengths  of  the  one-or-two-second  intervals 
were  used  in  the  rest  of  the  transformations  described  below. 

Instead  of  using  the  actual  values  of  CPTIME,  WAITTIME,  and 
OVERHEAD,  the  percentages  of  the  time  the  system  spent  in  the  respective 
modes  were  calculated.  Each  of  these  were  obtained  by  subtracting  the 
respective  value  at  the  time  instant  i  from  the  value  at  the  next  time 
instant  i+1,  dividing  this  by  the  length  of  the  time  interval  (i,i+l), 
and  then  multiplying  by  100.  The  values  of  PGREAD  and  PGSWAP  were  also 
calculated  for  the  time  interval  (i,i+l)  and  then  divided  by  the  length 
of  the  time  interval  to  obtain  the  paging  activity  per  second.  PROBTIME 
percentages  were  calculated  by  subtracting  the  (CPTIME  +  WAITTIME) 
percentages  from  100.  Thus,  the  values  of  the  transformed  variable 
PROBTIME  are  the  percentages  of  the  time  the  system  spends  in  the  problem 
mode.  Throughout  the  rest  of  this  chapter,  the  "trans formed”  variables 
will  carry  the  name  of  the  "actual"  variables. 

The  user  variables  were  transformed  as  follows:  Each  variable 
was  divided  by  NUMUSERS  and  then  multiplied  by  100  to  obtain  the 
percentage  of  the  total  number  of  users  in  a  particular  state.  During 
the  month  when  the  "data-gathering"  was  carried  out,  the  average  number 


of  users  was  about  eight. 
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Percentage  values  were  considered  for  the  following  reasons: 

Suppose  that  the  number  of  users  logged-on  to  the  system  in  a  particular 
twenty-minute  session  was  found  to  be  5  and  the  number  of  users  in 
also  was  found  to  be  5;  and  in  the  next  sample,  10  users  were  logged-on, 
and  again  5  users  were  found  in  Q  .  Then  to  deduce  that  the  number  of 
users  in  is  constant  and  equal  to  5  would  be  nonsensical.  A  percentage 
value  would  give  a  better  measure  for  comparisons.  The  actual  value 
should  be  used  if  a  maximum  value  of  the  number  of  users  in  a  particular 
state  is  to  be  predicted.  In  this  study  the  actual  values  were  not 
considered  since  the  number  of  users  was  too  low. 

The  above  described  percentage  values  of  the  fifteen  system  and 
user  variables  were  written  out  on  tape  for  each  time  instant.  They 
were  written  separately  for  each  sample  so  that  any  analysis  program 
could  be  directed  to  any  particular  sample,  and  only  to  that  sample,  if 
needed. 


Sec.  5.2.  Test  for  Randomness 

The  first  step  taken  to  analyze  the  data  was  to  test  the  sample 
observations  for  random  behaviour.  Testing  for  randomness  is  difficult 
because  there  is  no  overall  definition  of  "random."  Tests  for  certain 
criteria  of  nonrandom  behaviour  can  be  considered.  Then  the  rejection 
of  the  hypothesis  of  nonrandomness  supports  the  alternate  hypothesis  of 
random  behaviour  of  the  series  of  observations  considered.  Some  of  the 
characteristics  of  nonrandomness  of  a  series  of  observations  are: 

1.  The  presence  of  extreme  variations, 


the  presence  of  trends , 
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the  presence  of  periodic  fluctuations, 
U.  the  presence  of  discontinuities. 


Assuming  that  the  observations  are  a  random  sample  from  a  distri¬ 
bution  with  a  continuous  cumulative  density  function,  Wald  and  Wolfowitz 

(30)  (see  also,  Bennet  and  Franklin  (2),  page  687)  studied  the  distribu- 

n 

tion  of  the  random  variable  R'  =  Y  x.  x.  ..  .  For  R!  they  showed  that 

h  .  L.  1  l+h  1  J 

i=l 

mean  (RJ)  =  E(RJ)  =  (S*-S2), 

and 


Var  (Rp  =  a2, 


S?-S,  S2-1*S2S  +US  S  +S2-2S  (S2-S  )2 

2  4  1  12  132  4  12 

n-1  +  (n-1 )  (n-2 )  (n-l)2 


where  S 

K 


1 


i=l 


and  n  is  the  number  of  observations.  The  distribution  of  R'  is  shown  to 

1 

approach  normality  as  n  increases.  Hence,  the  significance  of  observed 
R '-values  can  be  tested  for  sufficiently  large  n  by  considering  T, 


R!  -  E(Rp 


1 


as  a  standardized  normal  random  variable. 


To  avoid  repetitions,  only  the  continuous  variable  CPTIME  was 
tested  for  samples  1  to  lU  and  the  values  of  T  were  tabulated  in  Table 
5.1.  The  upper  95$  critical  point  of  a  standardized  random  variable 
is  1. 61+5.  Hence  the  hypothesis  of  random  behaviour  of  the  observations 
is  accepted  for  all  the  samples. 
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TABLE  5.1 

STANDARDIZED  VALUES  OF  RANDOM  VARIABLE  R| 
AS  APPLIED  TO  CPTIME 


Sample 

T 

Sample 

T 

1 

0.166 

8 

0.072 

2 

0.203 

9 

0.795 

3 

0.113 

10 

0.233 

k 

0.278 

11 

0.092 

5 

0.162 

12 

0.231 

6 

0.06k 

13 

0.125 

7 

0.28U 

Ik 

0.081 

Sec.  5.3.  Tabulations  of  Sample  Statistics 

Two  types  of  tabulations  of  sample  statistics  (mean,  standard 
deviation,  maximum  value  and  minimum  value)  were  carried  out.  The  first 
set  of  15  tables  (see  Tables  I  in  Appendix  C)  are  the  sample  statistics 
for  each  of  the  15  variables,  for  the  l6  samples  separately.  The  second 
set  of  tables  (see  Tables  II  in  Appendix  C)  gives  the  mean  values  of 
the  variables  for  different  numbers  of  users  regardless  of  the  sample 
they  come  from.  The  only  restriction  applied  in  the  second  case  was 
that  the  mean  values  were  calculated  for  only  those  numbers  of  users  for 
which  there  were  more  than  100  observations  in  a  sample. 

Frequency  histograms  were  also  obtained  for  each  variable  and  all 
samples  to  get  a  picture  of  the  shape  of  the  distributions.  These 
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histogram-plots  are  given  in  Figures  HI  to  HlU  in  Appendix  D.  NUMUSERS 
is  the  only  variable  which  is  not  plotted  since  the  variation  associated 
with  this  variable  is  small. 

In  the  next  section,  a  systematic  discussion  of  the  implications 
of  the  above  mentioned  sample  statistics  for  each  variable  is  given. 

Sec.  5.k.  Discussion  of  Sample  Statistics 

1.  NUMUSERS  -  (see  Table  I (a)  Appendix  C). 

The  minimum  of  all  the  observed  values  of  NUMUSERS  was  3  and  the 
maximum  was  15.  Hence  there  existed  a  user  (virtual  machine)  population 
varying  from  3  to  15  users  during  the  time  of  data-gathering .  On  the 
average  there  were  8  users  logged-on  to  the  system  during  that  time. 

The  maximum  and  minimum  values  indicate  the  range  of  the  number  of  users 
in  each  sample.  For  example  in  the  sample  15,  observations  are  taken  for 
only  3  and  4  users  whereas  in  sample  8,  the  number  of  users  observed 
were  8  to  9- 

2.  CPTIME  -  (see  Table  l(b)  Appendix  C) 

An  overall  average  of  about  lQ%  was  obtained.  Thus  on  the 
average,  the  system  spent  18$  of  its  time  in  the  supervisor  or  CP-mode. 
The  high  values  of  the  standard  deviations  suggest  that  any  conclusions 
drawn  from  the  average  value  would  have  to  include  the  standard  error 
of  the  mean.  A  minimum  value  of  1%  and  a  maximum  value  of  100$  can  also 
be  read  off  this  table.  The  Fig.  HI  shows  the  observed  behaviour 
of  CPTIME.  Using  the  sample  mean  and  variance,  both  a  Poisson  and  a 
Normal  distribution  were  fitted  to  these  data,  but  were  rejected  since 
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the  Kolmogorov-Smirnov  (see  Feller  (12))  test  indicated  a  poor  fit.  The 
third  and  fourth  moments  were  also  calculated.  A  negative  value  of 
kurtosis  indicated  the  presence  of  a  smaller  number  of  large  deviations 
than  would  be  expected  if  the  distribution  were  normal. 

3.  WAITTIME  -  (see  Table  1(c)  Appendix  C) 

The  overall  observed  average  was  27$.  This  implies  that  27$  of 
the  total  time  the  central  processing  unit  was  idle.  This  is  a  very- 
high  value,  implying  inefficient  use  of  the  machine  by  the  users. 

WAITTIME  is  user  controlled  since  a  high  value  is  caused  by  users  not 
utilizing  the  CPU  facility  to  the  full  extent.  For  example,  in  sample 
9,  a  value  of  85.6$  is  obtained  for  a  user  population  varying  from  6  to 
15  users.  Although  one  would  expect  a  high  value  of  WAITTIME  for  a  low 
user  population,  this  is  not  always  so  as  Tables  l(a)  and  l(c)  indicate.  It 
may  be  more  meaningful  to  report  two  average  values  for  this  variable. 

There  is  either  a  low  value  as  in  samples  1,3,  5,  7,  8,  10,  12,  lU,  15, 
l6,  which  on  the  average  is  7.3$;  or  a  very  high  value  as  for  samples  2, 

U,  6,  9,  11,  13,  which  on  the  average  is  59-9$.  The  low  average 
WAITTIME's  were  plotted  as  a  histogram,  Fig.H2(a),  in  Appendix  D,  and 
are  all  concentrated  in  the  interval  (0,5 )$.  Fig.  H2(b)  indicates 
the  behaviour  of  the  high  average  WAITTIME's. 

k.  OVERHEAD  -  (see  Table  l(d)  Appendix  C) 

The  overall  average  was  6$  overhead  in  the  observed  time  intervals. 
The  maximum  value  of  OVERHEAD  among  all  the  samples  was  30$  which 
occurred  in  sample  11.  Looking  at  Table  I (a),  the  average  number  of 
users  logged-on  when  sample  11  was  obtained,  was  12  to  14,  which  is  one 


. 

. 


5  6 


of  the  highest  among  all  the  samples.  This  does  not  necessarily  -imply 
any  relation  between  NUMUSERS.  and  OVERHEAD.  The  correlation  between  these 
two  variables  was  calculated  and  found  to  be  very  small  for  all  the 
samples.  Correlation  studies  are  discussed  in  greater  detail  in  Sec. 

5.5  and  Sec.  5.6.  A  chi-square  distribution  with  mean  6  was  fitted  to 
the  OVERHEAD  values,  and  the  Kolmogorov- Smirnov  test  was  used  to  test 
the  fit.  A  poor  fit  was  indicated. 

A  new  variable  to  consider  at  this  point  is  (CPTIME  -  OVERHEAD), 
which  can  be  called  the  "actual  CP-mode"  time.  Referring  back  to 
Figure  ^.1,  this  is  the  time  the  system  spends  in  supervisor  mode  han¬ 
dling  CP  requests  (i.e.  handling  interrupts).  An  overall  average  of 
12$  was  obtained.  This  implies  that  one  eighth  of  the  total  time  is 
spent  handling  interrupts. 

The  low  average  OVERHEAD  value  of  6$  implies  high  efficiency  of 
the  scheduling  algorithm,  since  this  is  the  time  the  actual  scheduling 
takes  place  (see  Figure  U.l).  This  average  OVERHEAD  percentage  was 
obtained  for  an  average  of  8.3  users,  which  can  be  considered  as  a  low 
load  on  the  system.  The  constant  nature  of  the  average  value  over  all 
samples  suggests  that  this  variable  is  "user-independent"  and  an  increase 
in  the  number  of  users  may  not  affect  it  to  any  great  extent. 

5.  PROBTIME  -  (see  Table  l(e)  Appendix  C) 

This  variable  was  described  indirectly  when  the  two  variables 
CPTIME  and  WAITTIME  were  discussed,  but  the  importance  of  the  variable 
warrants  special  attention.  The  time  the  system  spends  in  the  problem 
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mode  can  be  considered  one  of  the  most  important  parameters  as  far  as 
calculating  system  efficiency  or  utilization  is  concerned.  Studying  the 
Table  1(e),  10  out  of  the  l6  samples  gave  a  value  of  around  70$ 
or  more.  Interestingly  enough  the  maximum  value  never  went  beyond  97*5$. 
Thus,  the  maximum  efficiency  obtained  from  the  system  was  97-5$.  As 
for  the  variables  CPTIME  and  WAITTIME,  the  sample  standard  deviations 
of  PROBTIME  are  large.  The  histogram  in  Fig.  HI 3  in  Appendix  D, 
indicates  the  shape  of  the  observed  frequency  distribution.  A  normal 
distribution  was  fitted  to  these  data  but  was  rejected  since  again,  the 
Kolmogorov- Smirnov  test  indicated  a  poor  fit. 

6.  PGREAD  and  PGSWAP  -  (see  Tables  l(f)  and  l(g)  Appendix  C) 

These  values  are  discussed  together  because  of  the  great  similari¬ 
ties  between  them.  An  overall  average  of  1.1  pages  per  second  read  in, 
and  1  page  per  second  swapped  out,  was  obtained.  In  sample  l4,  a  maximum 
value  of  49  pages  per  second  read  in  and  38  pages  per  second  swapped  out, 
was  found.  This  is  an  extremely  large  transfer  of  information  to  and 
from  disk  storage.  A  transfer  of  1+9  pages  (196K  bytes,  where  IK  =  1024 
bytes)  should  put  a  very  heavy  load  on  the  selector  channel  used.  Since 
a  selector  channel  transfers  312,000  bytes  per  second,  it  can  be  seen 
that  the  transfer  of  196K  bytes  would  take  0.65  seconds.  An  important 
measure  obtained  from  these  two  tables  is  the  total  number  of  pages  read 
in  plus  the  total  number  of  pages  swapped  out,  which  is  called  the 
"total  paging  load"  here.  The  maximum  paging  load,  9034  pages,  in  sample 
16  implies  that  a  maximum  (during  the  time  of  observation) of  36136K  bytes 
of  information  are  transferred  through  the  selector  channel  in  twenty 
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minutes.  Again,  using  the  data  transmission  rate  for  the  selector  channel, 
it  would  take  two  minutes  to  do  this.  Looking  at  sample  15  (3  users)  and 
sample  l6  (12  to  ill  users)  it  can  be  seen  that  a  very  low  value  of  PGREAD 
and  PGSWAP  was  obtained  for  sample  15  while  an  extremely  high  total  value 
was  obtained  for  sample  l6.  A  high  correlation  between  NUMUSERS  and 
paging  load  is  indicated.  Figures  HU  (PGREAD)  and  H5  (PGSWAP)  both 
show  a  concentration  of  values  in  the  interval  (0,2)  pages  per  second 
and  otherwise  are  nearly  alike. 

7.  PGWT  -  (see  Table  l(h)  Appendix  C) 

An  overall  average  of  only  0.8%  of  the  total  number  of  users  were 
found  in  PGWT  at  the  instant  of  observation.  The  maximum  and  minimum 
values  were  62.5%  (sample  l)  and  0%,  respectively.  Fig.  H6  indicates 
a  concentration  of  values  of  the  variable  in  the  interval  (0,2)%  and 
otherwise  very  few  extreme  values  were  observed. 

8.  IOWT  -  (see  Table  l(i)  Appendix  C) 

An  overall  average  of  2.58%  was  obtained  for  this  variable.  The 
maximum  and  minimum  values  were  100%  (sample  15)  and  0%,  respectively. 
Again,  large  values  of  the  standard  deviations  were  found.  Fig.  H7 
shows  that  over  80%  of  values  are  concentrated  in  the  interval  (0,2)%. 

Some  values  were  also  found  in  the  intervals  (8,10)%  and  (l8,20)%. 

9.  CFWT  -  (see  Table  l(j)  Appendix  C) 

On  the  average,  5.7%  of  the  total  number  of  users  were  found  to 
be  in  "console  function  wait."  The  maximum  and  minimum  values  were 
71.1*3%  (sample  4)  and  0%,  respectively.  Fig.  H8  confirms  that  the 
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variation  of  this  variable  is  very  large  indicating  the  extreme  fluctuations. 

10.  EXCFN  -  (Executing  Console  Function) 

The  value  of  this  parameter  was  always  found  to  be  zero.  The 
reason  is  that  a  user  can  only  be  executing  console  function  if  the 
system  is  in  supervisory  state.  At  the  time  of  observation,  the  virtual 
machine  CPSTATS  is  running  and  hence  the  system  is  in  the  problem  state. 

11.  EQ1  and  Q1  -  (see  Tables  l(k)  and  l(l)  Appendix  C) 

The  percentage  of  users  eligible  for  was  about  76% »  which  is 
very  much  higher  than  the  percentage  of  users  ( h%> )  actually  in  at 
the  instant  of  observation.  The  one-second  time  interval  between  obser¬ 
vations  appears  to  be  too  large.  In  order  to  obtain  a  more  realistic 
distribution  of  the  number  of  users  in  (i.e.  the  queue  length  distri¬ 
bution)  values  of  must  be  obtained  at  intervals  very  much  smaller 
than  one  second.  The  "dispatch  algorithm"  only  allows  a  user's  request 
to  remain  in  until  it  has  been  given  its  first  CPU  time-slice  of  less 
than  0.05  seconds  in  one  burst.  The  success  of  such  a  scheduling  scheme 
is  based  on  giving  good  response  to  users  whose  requests  are  satisfied 
by  one  short  burst  of  CPU  time.  In  a  console-oriented  interactive  system 
a  large  number  of  user  requests  are  of  this  type.  The  difference  between 
the  average  percentages  of  users  eligible  for  (EQl)  and  that  of  users 
in  console  function  wait  (CFWT)  was  about  70%.  This  implies  that  70% 
of  the  users  can  be  classed  as  "low  execution"  users.  This  value,  it 
must  be  remembered,  also  includes  users  who  are  logged-on  but  have  not 
requested  service.  This  is  substantiated  by  the  high  values  of  WAITTIME 
in  Table  l(c).  Over  85%  of  users  were  found  to  be  eligible  for  in 
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all  the  samples  with  high  values  of  WAITTIME.  The  maximum  value,  100$, 
occurred  in  sample  7  for  which  three  to  seven  users  were  logged-on. 
Otherwise  nearly  all  samples  showed  that  50$  or  less  of  the  users  were 
in  Q  .  The  frequencies  of  the  EQ1  and  Q1  values  were  plotted  in 
Fig.  H9  and  H10,  respectively.  Fig.  H10  indicates  that  nearly  all  Q1  ob¬ 
servations  are  in  the  interval  (0,2 )$. 

12.  EQ2  and  Q2  -  (see  Tables  l(m)  and  l(n)) 

On  the  average  about  2$  of  the  users  were  eligible  for  and 
about  17.2$  were  actually  in  Q2.  The  relatively  higher  value  of  Q2  can 
be  attributed  to  the  nature  of  user  requests  in  Q2.  They  represent 
requests  for  more  than  0.05  seconds  of  CPU  time  usually.  The  smaller 
value  of  EQ2  is  a  result  of  the  very  small  amount  of  time  spent  in  that 
state.  From  Tables  1(e),  l(m)  and  l(n),  respectively,  it  can  be  seen 
that  high  PR0BTIME  values  occur  in  the  samples  which  have  large  per¬ 
centages  of  users  in  and  eligible  to  enter  Q^.  Similarly,  the  samples 
with  low  PR0BTIME  values  correspond  to  low  Q2  and  EQ2  values.  Also, 
high  PR0BTIME  values  imply  low  values  of  WAITTIME.  Thus,  the  system 
seems  to  be  able  to  use  up  "idle  time"  by  running  a  user.  Fig. 

HI1  (Q2)  indicates  a  concentration  of  values  in  the  intervals  (l6,20)$ 
and  (36,1+0)$.  These  account  for  about  70$  of  the  values.  Fig.  H12 
(EQ2)  shows  a  concentration  of  values  in  the  interval  (0,2)$.  Some 
values  are  also  found  in  the  interval  (8,10)$. 

In  the  rest  of  this  section,  a  discussion  of  the  series  of  tables 
referred  to  as  Table  II  is  given.  Instead  of  discussing  each  table 
separately,  the  relation  of  some  of  the  system  variables  with  the  number 
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of  users  is  considered.  The  following  sample  correlation  coefficients 
were  found: 

TABLE  5.2 

CORRELATIONS  BETWEEN  THE  NUMBER  OF  USERS  AND 
CPTIME,  WAITTIME,  OVERHEAD,  AND  PGLOAD 


Variable 

Correlation 

CPTIME 

0.932 

WAITTIME 

0.002 

OVERHEAD 

0.00A 

PGLOAD 

0.951 

In  Fig.  5«1  the  mean  values  of  PGLOAD  (PGREAD  +  PGSWAP)  are  plotted 
against  the  number  of  users,  see  Table  II (a)  to  II (g).  Plotting  log^ 
(PGLOAD)  against  the  number  of  users,  see  Fig.  5-2,  nearly  produces  a 
straight  line,  which  suggests  an  exponential  relationship  between  these 
two  variables.  To  test  this  hypothesis  a  linear  regression  analysis  was 
done  on  logg  (PGLOAD)  (say,  y)  against  NUMUSERS  (say,  x).  The  following 
results  were  obtained. 

The  regression  equation  was  found  to  be 
y  =  *58x  -  U . 9 , 

where  the  standard  error  of  the  slope  is  0.0?6,  and  the  correlation  of 
x  and  y  is  O.96.  Transforming  y  to  e^,  the  PGLOAD  values  may  be 
predicted  by  the  equation 

•  58x 


PGLOAD  =  0.0075  e 
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TABLE  5.3 


NUMUSERS  AGAINST  PGLOAD 

AND  CPTIME 

NUMUSERS 

PGLOAD 

Loge( PGLOAD) 

CPTIME 

Logg (CPTIME) 

3 

0.015 

-4.20 

12.5 

2.52 

4 

0.15 

-1.90 

13.7 

2.62 

7 

0.56 

-O.58 

15.0 

2.71 

8 

1.50 

o.4i 

21.0 

3.05 

10 

2.91 

1.07 

19.1 

2.95 

11 

4-77 

1.56 

21.5 

3.07 

13 

8.50 

2.14 

25.5 

3.24 

Fig.  5.1.  Plot  of  PGLOAD  Against  NUMUSERS 


>** 

Fig.  5.P.  Plot  of  Log^  (PGLOAD)  Against  NUMUSERS 
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where  x  is  the  number  of  users.  The  observed  F-value  in  the  analysis 
of  variance  of  the  regression  was  found  to  be  58.3  with  1  and  5  degrees 
of  freedom.  The  critical  value  F  (*0l)  =  16.26  which  is  less  than 
the  observed  F-value.  This  implies  that  the  predicted  values  of  PGLOAD 
using  this  relation  produce  a  good  fit  to  the  observed  values. 


The  maximum  number  of  pages  that  can  be  transferred  through  the 
selector  channel  in  one  second  can  be  calculated  as  follows:  The  channel 
can  transfer  312,000  bytes  per  second  and  1  page  consists  of  b096  bytes; 
hence  the  number  of  pages  transferred  per  second  is 


312000 

b096 


76.2. 


Thus  assuming  that  the  selector  channel  is  available  100%  of  the  time 
for  paging,  this  maximum  value  can  be  used  to  calculate  the  correspond¬ 
ing  value  of  the  number  of  users.  Since, 

PGLOAD  =  0.0075  e°'58x. 


for  the  maximum  number  of  pages  per  second  it  can  be  seen  that 


76.2  =  0.0075  e°'58x 


0.58x 

e 


101U0, 


0.58x  =  loge  (lOlUo) 

=  9.2252 

and  x  =  15.91. 

Hence,  for  about  1 6  users,  the  channel  would  be  used  for  paging  100%  of 
the  time.  In  actual  fact  the  channel  is  not  available  100%  of  the  time 
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for  this  purpose.  A  great  amount  of  time  is  usually  spent  in  seeking 
the  required  data  from  disk  and  during  this  time  the  channel  is  inactive. 
In  actual  fact  then,  a  queue  for  the  selector  channel  service  should 
build  up  for  a  smaller  number  of  users. 

In  Table  5*3  the  mean  values  of  CPTIME  are  given  for  different 
values  of  the  number  of  users,  (also  see  Tables  II (a)  to  11(g)).  The  plot 
of  loge (CPTIME)  against  the  number  of  users,  see  Figure  5*3,  gives  a 
straight  line,  which  again  indicates  an  exponential  relationship  between 
these  two  variables.  This  hypothesis  was  tested  by  performing  a  linear 
regression  analysis  on  log^(CPTIME)  (say,  y)  and  NUMUSERS  (say,  x).  The 

following  results  were  obtained. 

The  regression  equation  was  found  to  be 
y  =  0.07  x  +  2.3^+, 

where  the  standard  error  of  the  slope  is  0.01,  and  the  correlation  of 
x  and  y  is  0.9^.  Transforming  y  to  ey,  the  CPTIME  values  can  be 
predicted  by  the  equation 

CPTIME  =  10  e°*°Tx, 

where  x  is  the  number  of  users.  The  F-value  in  the  analysis  of  variances 
of  the  regression  was  found  to  be  39-02  with  1  and  5  degrees  of  freedom. 
This  value,  again,  is  greater  than  the  critical  value  F^  <_(0.0l).  Hence 
CPTIME  values  may  be  predicted  to  give  a  good  fit  to  the  observed  values, 
using  the  above  equation. 


For  example,  for  x  -  20, 


Log  ^  (CPTIME) 


Fig.  5-3.  Plot  of  Logg  (CPTIME)  Against  NUMUSERS 
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1  1+ 

CPTIME  =  10  e 

=  h0%. 

Calculating  the  value  of  NUMUSERS  for  CPTIME  at  100%,  a  value  of  33 
users  is  obtained.  This  value  seems  unrealistic  because  it  implies 
that  with  33  users  or  more,  the  system  would  be  performing  supervisory 
functions  100%of  the  time.  It  may  be  that  the  behaviour  of  the  curve 
beyond  the  range  in  Figure  5-^  is  different  from  the  calculated  exponen¬ 
tial  shape.  It  could  have  a  maximum  for  x  <  33  users  and  remain  steady 
at  this  value.  There  is  no  way  such  a  judgement  can  be  made  with  the 
available  data.  This  could  be  one  of  the  future  projects  to  study. 

Sec.  Further  Regression  Analysis 

At  this  stage  some  of  the  expected  relationships  between  the 
variables  are  discussed.  The  first  and  foremost  relation  to  be  studied 
is  the  effect  of  the  percentage  of  users  in  and  on  the  PROBTIME 
percentages.  A  large  percentage  of  users  in  (or  eligible  for  Q^) 
would  reduce  the  system  efficiency,  unless  of  course,  there  are  enough 
users  in  "who  would  use  up  the  "idle"  time  caused  by  too  much  console 
activity.  PROBTIME  should  be  inversely  proportional  to  the  number  of 
users  in  the  wait  states,  whereas,  CPTIME  should  be  proportional  to  this 
number.  The  PAGELOAD  variable  could  perhaps  be  expressed  in  terms  of 
the  users  in  page  wait.  Stepwise  regression  analyses  were  carried  out 
to  study  the  relationships  discussed  above.  An  independent  variable 
was  allowed  to  enter  the  regression  if  it  produced  a  reduction  in  the 
error  sum  of  squares  of  more  than  1 l.  A  sample  output  of  these  analyses 
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is  given  in  Appendix  E. 

1.  PROBTIME  as  the  dependent  variable 

Regression  analysis  was  performed  separately  for  all  the  samples. 
The  percentage  of  users  eligible  for  and  in  entered  the  regression 
in  all  the  samples.  Interestingly  enough,  the  "wait-state"  variables 
never  entered  the  regression.  This  again  implies  that  a  runnable  user 
must  have  been  available  to  the  system  most  of  the  time.  At  the  1%  level, 
the  variable  Ql,  also,  did  not  enter  the  regression,  signifying  the  small 
contribution  it  makes  to  system  performance  as  far  as  high  utilization 
of  the  CPU  is  concerned.  A  high  value  (over  0.7  on  the  average)  of 
multiple  correlation  was  obtained  for  all  the  samples.  Different  values 
of  regression  coefficients  were  obtained  for  each  sample.  The  following 
relation,  from  sample  7»  indicates  the  nature  of  the  relationship: 

PROBTIME  =  b.h  ( Q2 )  +  U(EQ2)  -  0.06  [ ( Q2 ) 2  +  ( EQ2 ) 2 ]  +  10. k. 

Thus ,  a  number  of  Q2  users  are  necessary  among  all  the  users  to  run  the 
system  efficiently. 

2.  "CP  mode  time"  as  the  dependent  variable,  where  CP  mode  time  = 

(CPTIME  -  OVERHEAD) 

A  relation  between  CP  mode  time  and  the  three  main  supervisory 
functions  of  CP  was  obtained  by  regression.  These  three  functions  are: 

(i)  performing  I/O,  i.e.  the  percentage  of  users  in  I/O  wait;  (ii)  paging, 
i.e.  PGLOAD ;  and  (iii)  dealing  with  console  requests,  i.e.  percentage 
of  users  in  CF  wait.  This  relation  should  be  treated  cautiously  because 
a  low  multiple  correlation  (0.57  on  the  average)  was  obtained. 
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3.  PGREAD  as  the  dependent  variable 

A  high  correlation  (>  0.9^)  between  PGREAD  and  PGSWAP  was  calcu¬ 
lated.  The  following  relationship  was  obtained: 

PGREAD  =  c  +  b  (PGSWAP), 

where  c  was  found  to  be  0.04,  on  the  average,  and  b  was  a  little  greater 
than  1.  An  example  of  this  relationship,  from  sample  13,  is 

PGREAD  =  0.02  +  1.09  (PGSWAP). 

Thus,  the  value  of  PGREAD  was  normally  found  to  be  greater  than  that  of 
PGSWAP.  This  is  due  to  pages  being  swapped  only  if  a  request  for  a 
new  page  cannot  be  satisfied  for  lack  of  main  storage  space  to  fit  the 
requested  page  (see  Sec.  3.^).  Hence,  if  main  storage  is  not  completely 
utilized,  the  value  for  PGSWAP  will  be  less  than  that  of  PGREAD.  It 
appears  that  the  main  storage  is  usually  fully  utilized  and  that  these 
two  variables  are  equal. 

1+.  PGLOAD  as  the  dependent  variable 

The  following  variables  entered  the  regression:  (i)  PGWT,  (ii) 
Ql,  and  (iii)  CP  mode  time. 

The  entry  of  variable  Ql  implies  that  users  in  make  most  of 
the  paging  requests.  Since  Q2  did  not  enter  the  regression,  users  in 
do  not  require  a  significant  amount  of  paging.  The  entry  of  PGWT  and 
CP  mode  time  in  the  regression  was  as  expected.  A  low  value  (0.57  on 
the  average)  of  the  multiple  correlation  coefficient  was  obtained. 


A  similar  result  to  (4)  was  obtained  for  I0WT  as  the  dependent 
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variable.  Again,  users  in  make  more  I/O  requests  than  users 
in  Q2. 

Sec.  5.6.  APL  Results 

To  make  the  discussion  of  the  APL  results  more  meaningful,  refer¬ 
ence  is  made  constantly  to  the  series  of  tables  called  Table  III  in 
Appendix  C  and  to  the  histograms  in  Figures  Hill  to  H17  in  Appendix  D. 
Results  from  APLDATA  were  written  onto  tape  with  the  percentage  values 
of  TIMEUSED  and  VTOTTIME  (see  Sec.  4.U)  calculated.  The  observed  values 
of  NUMPAGES,  PRIORIT,  and  NUMDIAL  (number  of  users  dialled-on  to  the 
APL  virtual  machine)  were  written  out  on  tape  without  having  transformed 
these  values.  The  state  of  the  APL  virtual  machine  at  every  second  was 
written  on  tape  in  the  following  way:  Considering  the  eight  user-states; 
(i)  PGWT,  (ii)  IOWT,  (iii)  CFWT,  (iv)  EXCFN ,  (v)  Q2,  (vi)  Ql,  (vii)  EQ2, 
and  (viii)  EQ1,  an  eight  digit  number,  with  a  '1*  in  the  position  corre¬ 
sponding  to  the  state  of  the  virtual  machine  at  that  instant  and  ’O'  in 
every  other  position,  was  written.  Thus,  if  APL  was  in  IOWT  and  in  Ql, 
the  eight  bit  number  would  be  01000100.  Thus,  at  every  instant  of  ob¬ 
servation  each  of  the  eight  APL-state  variables  either  had  the  value 
1  or  0. 


The  sample  statistics,  mean,  standard  deviation,  maximum,  and 
minimum  values  of  NUMDIAL,  TIMEUSED,  VTOTTIME,  and  NUMPAGES,  were  calcu¬ 
lated  for  the  six  samples  (see  Tables  III (a)  to  111(d)).  The  value  of 
PRIORIT  was  found  to  be  equal  to  zero  in  all  the  samples.  Significance 
of  this  is  discussed  later  in  this  section.  The  values  of  the  APL-state 
variables  were  added  separately  for  each  sample;  divided  by  the  total 
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number  of  observations  in  the  respective  sample;  and,  then  multiplied 
by  100  to  obtain  the  total  percentage  of  time  the  APL  virtual  machine 
was  in  the  different  states  (see  Table  111(e)). 

Instead  of  discussing  each  variable  separately  a  joint  description 
is  given  here.  From  Table  Ill(a),  a  range  of  6  to  17  users  dialled  to 
the  APL  virtual  machine  was  found.  An  overall  average  of  13  users 
existed  at  the  time  of  observation.  Utilizing  the  VTOTTIME  percentages, 
the  average  percentage  of  CPU  time  used  by  the  APL  machine  was  30.1#  (see 
Table  IIl(c)).  The  supervisor  on  the  average  was  used  1.7 #  of  the  time, 
which  was  obtained  from  the  (TIMEUSED  -  VTOTTIME)  percentages.  This 
implies  that  the  APL  machine  was  performing  I/O  or  was  idle  65.2#  of  the 
total  time.  The  APL  machine,  hence,  can  be  described  as  an  "I/O  bound" 
user.  From  Table  Ill(d),  the  APL  machine  has  on  the  average  5l  pages  in 
core.  The  overall  maximum  value  of  NUMPAGES  was  found  to  be  91*  Thus, 
a  maximum  of  91  x  1  =  36lK  bytes  of  main  storage  was  used,  which  classi¬ 
fies  APL  as  a  "heavy"  core  user.  Even  the  overall  minimum  value  of  29 
pages,  in  sample  5S  occupies  116K  bytes  in  the  main  storage.  Fig.  HI 7 
shows  a  typical  plot  of  NUMPAGES  with  some  concentration  of  values  in 
the  intervals  (21,28)  pages  and  (28,32)  pages.  In  Fig.  HI 5  for  the 
variable  TIMEUSED,  the  main  concentration  of  values  was  found  in  the 
interval  (0,5)#.  A  similar  concentration  in  the  interval  (0,5)#  was 
found  for  the  variable  VTOTTIME  (see  Fig.  Hl6).  A  chi-square  distri¬ 
bution  was  tried  to  fit  the  observed  values  of  (TIMEUSED  -  VTOTTIME) 

(see  Fig.  Hi!)  but  the  fit  turned  out  to  be  poor. 


On  the  average  both  Q  and  Q2  contained  APL  virtual  machine 
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requests  33$  of  the  time  (see  Table  111(e)).  Whereas  the  APL  user  was 
eligible  for  27%  of  the  time,  it  was  eligible  for  only  7$  °f  the 
time.  The  scheduling  algorithm  was  designed  to  keep  the  APL  requests  in 
Q^,  but  this  does  not  seem  to  be  the  case  at  the  time.  Since  the  observed 
value  of  PRIORIT  (APL’s  priority  in  Q^)  (see  Sec.  is  always  zero, 

an  APL  request  does  not  seem  to  remain  in  for  long.  This  value  of 
PRIORIT  implies  that  the  APL  virtual  machine  does  not  use  0.05  seconds 
of  CPU  time  in  one  burst  while  in  Q^.  If  it  did,  a  nonzero  value  of 
PRIORIT  would  be  observed.  Hence,  the  apparent  flaw  of  the  scheduling 
algorithm  is  not  serious.  Some  of  the  users  of  the  APL  machine  may  have 
"compute-bound"  programs  which  are  forced  into  by  using  0.05  second 
in  one  burst,  while  in  Q^.  From  Table  Ill(e)  it  can  be  seen  that  on 
the  average,  the  percentages  of  time  spent  in  PGWT  and  IOWT  are  0.75, 

3.3.  Whereas  the  values  of  CFWT  and  EXCFN  could  not  be  found  by  the 
method  user.  The  low  percentages  may  be  attributed  to  the  length  of  the 
time  interval  between  observations.  Another  reason  for  these  small 
values  may  be  that  the  360/APL  system  schedules  the  execution  of  its 
"ready"  users  while  another  of  its  users  is  in  one  of  the  above  wait 
states . 

From  the  above  results,  the  APL  virtual  machine  can  be  charac¬ 
terized  as  follows:  The  APL  requests  are  console  oriented  (27$  of  the 
time  in  the  EQ1  state),  I/O  bound  (only  3^.8$  of  the  time  are  the  CPU  and 
the  CP  supervisor  used)  and  core  storage  demanding  (on  the  average  5^ 
pages  are  in  core,  which  is  equivalent  to  212K  bytes  of  main  storage). 

The  final  conclusion,  an  assessment  of  the  achievements  of  these  models 
and  guidelines  for  further  research,  are  discussed  in  the  next  chapter. 
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CHAPTER  VI 


CONCLUSION  AND  SUGGESTIONS  FOR  FURTHER  STUDIES 

The  main  aim  of  this  study  was  to  obtain  some  kind  of  evaluation 
measures  of  the  time-sharing  system,  CP/67.  One  of  the  measurements 
obtained  was  the  average  CPU  utilization  (PROBTIME)  value  of  about  55% 
whereas  the  maximum  CPU  utilization  was  found  to  be  about  97.5%.  A 
synopsis  of  the  system  performance  as  observed  during  the  specified 
data- gathering  period  is  given  below: 

TABLE  6.1 

SUMMARY  OF  SYSTEM  PARAMETERS 


Percentage 
of  time  in 

Overall 

Average 

10  of  16 

samples  with 
high  PROBTIME 

Samples  with 
minimum 
WAITTIME 

Highest 

number 

of  Users 

APL 

Machine 

CP  mode 

12.0 

12.8 

13.6 

18.1 

4.7 

Overhead 

6.0 

6.9 

7.3 

7.1 

- 

Probtime 

55.0 

73.0 

78.6 

70.2 

30.1 

Wait time 

27.0 

7.3 

0.5 

4.6 

65.2 

Number  of 

Users 

8.3 

8.1 

10.2 

12.5 

12.9 

From  Table  6.1,  it  is  unrealistic  to  expect  an  average  problem 


time  to  be  greater  than  79%.  To  discuss  the  total  overhead  of  the 
supervisor  both  the  interrupts  charged  to  the  respective  virtual 
machines  (CP  mode)  and  the  actual  recorded  overhead  must  be  added.  The 
above  measures  only  describe  CPU  time  utilization  without  considering 
the  core  utilization.  An  overall  average  of  2  pages  were  read  in  and 
swapped  out  every  second.  Just  the  fact  that  the  pages  were  swapped 
every  second  implies  that  the  main  storage  was  used  to  capacity.  In 
Sec.  5.1+  it  was  found  that  PGLOAD  increases  exponentially  with  an 
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increasing  number  of  users.  In  reporting  system  performance  it  must 
be  realized  that  these  measurements  are  user  dependent. 

The  "user  states"  parameters  describe  the  general  behavior  of 
users  or  try  to  characterize  the  user  population.  The  console  oriented 
nature  of  the  users  is  very  evident  through  the  usually  high  values  of 
EQ1  (see  Sec.  5.4).  Considering  the  regression  analysis  in  Sec.  5*5  it 
was  found  that  the  users  in  do  not  reach  wait-states  as  often  as  the 
users  in  Q  and  hence,  may  be  classed  as  "long  execution"  users.  An 
overall  average  of  17$  of  the  users  were  of  this  type.  Another  user 
characteristic  appears  to  be  that  a  fairly  high  percentage  of  them  sit 
at  their  terminals  "thinking"  and  force  the  system  into  a  wait-state. 

The  scheduling  algorithm  should  not  be  blamed  for  this.  The  high 
average  values  of  WAITTIME  appear  to  be  entirely  due  to  the  users. 

From  Table  6.1  it  is  clear  that  APL  applies  a  very  heavy  load  on 
the  CP  system.  Problem  mode  time  of  30.1$  is  approximately  half  of  the 
total  CP  problem  mode,  and  since  the  range  of  average  values  is  from 
10%  to  5U$  there  are  some  cases  when  it  is  well  over  half.  Moreover, 

APL  was  a  heavy  user  of  core  storage  using  an  average  of  212K  bytes 
with  a  range  from  ll6K  to  364K  bytes . 

One  of  the  most  important  achievements  of  this  study  is  the 
method  used  to  obtain  the  data.  This  method  is  unique  in  that  it  does 
not  involve  making  any  internal  changes  to  the  system.  The  load  imposed 
by  the  data-gathering  virtual  machine  was  similar  to  the  load  by  any 
other  user  and  therefore  did  not  bias  the  results  significantly.  Thus, 
the  method  seems  to  be  an  ideal  one  to  use  for  further  research. 
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Let  it  be  stressed  that  a  group  of  users  classed  as  "the  average 
user"  is  considered  not  to  exist.  The  only  solution  that  seems  possible 
is  to  smooth  the  series  of  observations  by  attaching  "weights"  according 
to  the  user -population  they  are  obtained  from.  An  extensive  study  of 
the  "type"  of  lasers  should  be  carried  out  to  classify  them  in  different 
groups  and  associate  "weights"  according  to  the  demands  they  make  on  the 
system.  For  example,  I/O  bound  and  compute-bound  users  can  be 
classified  in  such  a  way.  It  might  even  be  necessary  to  divide  compute- 
bound  users  into  subclasses  according  to  their  CPU  time  requirements. 

This  would  allow  a  selection  of  user  classes  which  could  be  used  as 
bench  marks  in  future  system  evaluation.  Using  the  overall  average 
problem  time  of  55%  with  7  CP  users  and  average  APL  problem  time  of 

30.1%  with  13  users  from  Table  6.1,  it  is  conjectured  that,  on  the  aver¬ 

age,  25%  of  the  problem  time  was  spent  servicing  the  7  CP  users.  If  it 
is  assumed  that  the  problem  time  cannot  exceed  75%  it  would  appear  that 
13  CP  users  and  the  APL  machine  (with  13  users)  would  produce  the  max¬ 
imum  utilization  (highest  problem  time).  If  there  are  only  CP  users 
then  it  is  conjectured  that  the  maximum  utilization  would  be  obtained 
with  21  users;  if  there  are  only  APL  users  it  is  conjectured  that  the 
maximum  would  occur  with  31  users.  For  a  large  system  like  360/67  to 

support  only  21  CP  users  or  31  users  of  a  single  language,  it  is  clearly 

very  expensive. 

Finally  to  sum  up  the  achievements  of  this  project  it  is  suggested 
that  an  important  new  method  of  obtaining  data  is  discovered.  Various 
performance  measures  describing  the  system  are  obtained,  which  are  use¬ 
ful  in  evaluating  the  time-sharing  system  CP/67. 
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APPENDIX  C 


The  series  of  tables,  Table  l(a)  to  Table  l(n), gives  the  sample 
statistics  of  each  variable  in  every  sample.  The  number  of  observations 
varies  between  1000  and  1200  observations.  The  series  of  tables,  Table 
Il(a)  to  Table  Il(g),  gives  the  mean  values  of  these  variables  for 
different  values  of  the  number  of  users.  Each  value  is  from  a  subsample 
with  greater  than  100  observations.  The  series  of  tables,  Table  Ill(a) 
to  Table  Ill(d), gives  the  sample  statistics  for  the  APL  variables.  The 
number  of  observations  varies  from  approximately  550  to  700  observations. 
The  APL  virtual  machine  states  are  tabulated  in  Table  Ill(e). 
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TABLE  1(a) 

NUMUSERS:  THE  NUMBER  OF  VIRTUAL  MACHINES  LOGGED  ON 


Sample  Number 

Mean 

Standard  Deviation 

Maximum 

Minimum 

1 

8.45 

0.78 

10.00 

8.00 

2 

8.46 

0.70 

12.00 

8.00 

3 

10.20 

0.45 

11.00 

9-00 

4 

7.12 

0.84 

9.00 

5.00 

5 

T  •  35 

0.69 

10.00 

7.00 

6 

9.92 

o.4o 

12.00 

9.00 

T 

4.48 

1.21 

7.00 

3.00 

8 

8.03 

0.19 

9.00 

8.00 

9 

8.28 

1.95 

15.00 

6.00 

10 

8.35 

0  .66 

9.00 

7.00 

11 

12.45 

0.59 

15.00 

11.00 

12 

7.60 

0.52 

9.00 

7.00 

13 

6.15 

0.36 

7.00 

6.00 

l4 

10.63 

0.65 

12.00 

9.00 

15 

3.00 

0.08 

4.00 

3.00 

l6 

12.85 

0.4l 

l4  .00 

12.0  0 

Overall  Average  =  8.3  users 
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TABLE  1(b) 


CPTIME: 

THE  PERCENTAGE 

OF  TIME  THE  SYSTEM 

WAS  IN  CP 

MODE 

Sample  Number 

Mean 

Standard  Deviation 

Maximum 

Minimum 

1 

26.3k 

13.38 

83.98 

3.00 

2 

15-35 

9.02 

67.00 

1.00 

3 

20.89 

9.67 

51.98 

3.49 

4 

21.89 

13.30 

65.01 

1.50 

5 

15-36 

8.45 

58.98 

1.50 

6 

14.20 

9.64 

67.02 

1.00 

T 

13.63 

6.84 

43.02 

1 .50 

8 

22.70 

9.02 

53.00 

4.00 

9 

9.02 

9.24 

75-00 

1.00 

10 

16.10 

8.23 

46.02 

3.49 

11 

25.58 

14.89 

87.00 

2.00 

12 

21.26 

9.46 

74.00 

3.00 

13 

7.12 

4.83 

51.00 

1.00 

Ik 

21.64 

10.51 

80.98 

3.00 

15 

12.17 

6.45 

53.00 

2.00 

16 

25.18 

11.49 

100.00 

4.50 

Overall 

Average  = 

18  .  0  356 

of  total  time 

, 
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TABLE  1(c) 

WAITTIME:  THE  PERCENTAGE  OF  TIME  THE  SYSTEM  WAS  IN 

"IDLE”  STATE 


Sample  Number 

Mean 

Standard  Deviation 

Maximum 

Minimum 

1 

k.ok 

12.21 

100.00 

0.00 

o 

c. 

69.33 

32.91 

100.00 

0.00 

3 

0.^3 

2.77 

60.01 

0.00 

4 

46.21 

36.07 

100.00 

0.00 

5 

14.52 

29.85 

100.00 

0.00 

6 

53.91 

41.50 

100.00 

0.00 

7 

18.26 

33.75 

100.00 

0.00 

8 

1.79 

3.98 

33.98 

0.00 

9 

85  •  66 

24.58 

100.00 

0.00 

10 

2.87 

13.18 

100.00 

0.00 

11 

41.12 

36.25 

100.00 

0.00 

12 

6.95 

17.84 

100.00 

0.00 

13 

63.16 

44.00 

100.00 

0.00 

l4 

7.58 

21.27 

100.00 

0.00 

15 

11 . 70 

28.16 

100.00 

0.00 

16 

4.58 

7.47 

43.02 

0.00 

Overall  Average  =  27.00#  of  total  time 
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TABLE  1(d) 


OVERHEAD : 

THE  PERCENTAGE 

OF  TIME  THE  SYSTEM 

SPENT  IN 

OVERHEAD 

Sample  Humber 

Mean 

Standard  Deviation 

Maximum 

Minimum 

1 

9-79 

5.09 

25.98 

1.00 

2 

k.2k 

2.47 

16.00 

0.00 

3 

7-30 

3.34 

20.00 

1.50 

4 

6.77 

4.88 

27.00 

0.00 

5 

5A6 

2.86 

18.01 

0.00 

6 

4.22 

2.66 

17.00 

0.50 

7 

5.86 

3.09 

17.02 

0.00 

8 

7-56 

2.87 

20.00 

1.00 

9 

2.14 

1.98 

18.00 

0.00 

10 

5.81 

2.77 

17.99 

1.00 

11 

7.25 

4.25 

30.00 

0.00 

12 

7.87 

1.11 

23.00 

1.00 

13 

2.64 

1.81 

16.00 

0.00 

14 

6. 86 

3.12 

26.00 

0.50 

15 

4.87 

2.56 

17.02 

0.00 

1 6 

7.14 

3.09 

26.00 

1.50 

Overall 

Average  = 

6.00  % 

of  total  time 

✓ 
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TABLE  1(e) 

PROBTIME:  THE  PERCENTAGE  OF  TIME  THE  SYSTEM  WAS  IN  PROBLEM  MODE 


Sample  Number 

Mean 

Standard  Deviation 

Maximum 

Minimum 

1 

69.60 

19.43 

97-00 

0.00 

2 

15.30 

31.03 

97-00 

0.00 

3 

78.56 

10.70 

96.51 

8.98 

4 

31.88 

32.83 

95.00 

0.00 

5 

70.10 

29.81 

97-00 

0.00 

6 

31.88 

39.70 

97.00 

0.00 

7 

68.09 

33.53 

97.01 

0.00 

8 

75.49 

11.27 

96.00 

0.00 

9 

5.30 

20.91 

95.00 

0.00 

10 

81.00 

15.91 

96.51 

0.00 

11 

33.28 

35*42 

96.00 

0.00 

12 

71.76 

19.93 

96.51 

0.00 

13 

29.70 

42.75 

97.50 

0.00 

l4 

70.76 

22.54 

96.00 

0.00 

15 

76.ll 

28.24 

97-51 

0.00 

16 

70.22 

14.76 

95.50 

0.00 

Overall  Average  =  54.96%  of  total  time 
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TABLE  1(f) 

PGRD :  THE  NUMBER  OF  PAGES  READ  IN  PER  SECOND 


Mean 

Standard 

Deviation 

Total  in 

20  Minutes 

Maximum 

Minimum 

0.70 

1.91 

71+7 . 82 

18.00 

0.00 

0.57 

1.68 

610.50 

20.00 

0.00 

1.88 

3.51 

2,052.82 

33.00 

0.00 

0.66 

1.92 

716.50 

16.00 

0.00 

0.7*+ 

2.13 

808.50 

26.00 

0.00 

1.18 

2.88 

1,160.83 

35.00 

0.00 

0.06 

0.50 

68.50 

9.00 

0.00 

1.52 

3.38 

1,685.16 

1+0.00 

0.00 

0.77 

2. 3U 

899 . 50 

27.00 

0.00 

0.1+0 

1.31 

1+21+.50 

13.00 

0.00 

3.1+2 

5.62 

3,81+0.66 

1+8.00 

0  .00 

0.31+ 

1.11 

1+07.50 

11.00 

0.00 

0.1+1+ 

1.1+2 

1+91.83 

11.00 

0.00 

2.1+2 

1+.65 

2,671+. 50 

1+9.00 

0.00 

0.01 

0.12 

15.00 

2.00 

0  .00 

U.l+7 

5.1+8 

1*  ,803. 50 

1+5.00 

0.00 

Overall  Average:  1.1  pages  per  second 
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TABLE  1(g) 

PGSWP:  THE  NUMBER  OF  PAGES  SWAPPED  (READ  OUT)  PER  SECOND 


Mean 

Standard 

Deviation 

Total  in 

20  Minutes 

Maximum 

Minimum 

0.56 

1.55 

598.16 

15.00 

0.00 

0.51 

1.59 

5U7.OO 

19.00 

0.00 

1.69 

3.15 

1,81+2.82 

30.00 

0.00 

0.1+2 

1.1+9 

1+53.00 

16.00 

0.00 

0.61+ 

1.87 

698.50 

25.00 

0.00 

1.00 

2 .50 

985.33 

28.00 

0.00 

0.03 

0.31+ 

33.50 

9.00 

0.00 

1.32 

2.98 

1,1+58.83 

37.00 

0  .00 

0.59 

1.99 

689.50 

21+.00 

0.00 

0 .31+ 

1.19 

363.00 

12.00 

0.00 

2.8l 

U.77 

3,156.16 

36.00 

0  .00 

0.29 

0.98 

351+.50 

10.00 

0.00 

0.38 

1.27 

1+33.83 

10.00 

0.00 

2.06 

3.96 

2,285.00 

38.00 

0.00 

0.00 

0.00 

0.00 

0.00 

0.00 

3.9^ 

U .  87 

1+ ,231. 50 

31+.00 

0.00 

Overall  Average:  1.04  pages  per  second 
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TABLE  1(h) 

PGWT:  THE  PERCENTAGE  OF  USERS  IN  PAGE  WAIT 


Sample  Number 

Mean 

Standard  Deviation 

Maximum 

Minimum 

1 

1.61 

6.08 

62.50 

0.00 

2 

0.39 

2.38 

28.57 

0.00 

3 

0.78 

2.89 

22.22 

0.00 

1+ 

0.30 

2.27 

28.57 

0.00 

> 

0.38 

2. hO 

16.67 

0.00 

6 

0 .  U6 

2.3h 

25.00 

0.00 

7 

0.20 

2.58 

50.00 

0.00 

8 

0.89 

3.56 

28.57 

0.00 

9 

0.6l 

5.04 

33.33 

0.00 

10 

0. 24 

1.98 

28.57 

0.00 

11 

1.37 

4.01 

41.67 

0.00 

12 

0.88 

3.77 

33.33 

0.00 

13 

0.39 

2.85 

40.00 

0.00 

l4 

1.18 

4.11 

60.00 

0.00 

15 

0.08 

2.11 

50.00 

0.00 

16 

1.96 

4.46 

33.33 

0.00 

Overall  Average  =  0.8%  of  users 


v 
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TABLE  I(i) 

IOWT:  THE  PERCENTAGE  OF  USERS  IN  I/O  WAIT 


Sample  Number 

Mean 

Standard  Deviation 

Maximum 

Minimum 

1 

5.30 

8.59 

42.86 

0.00 

2 

1.78 

4.70 

25.00 

0.00 

3 

3.21 

6.26 

4o .  00 

0.00 

4 

it. 06 

8.17 

42.86 

0.00 

5 

1.74 

5.16 

37.50 

0.00 

6 

0.98 

3.30 

22.22 

0.00 

7 

1.66 

7.54 

50.00 

0.00 

8 

5.26 

9.28 

42.86 

0.00 

9 

0.91 

3.67 

28.57 

0.00 

10 

1.73 

4.79 

28.57 

0.00 

11 

2.06 

4.60 

27.27 

0.00 

12 

3.49 

7.32 

42.86 

0.00 

13 

0.44 

2.92 

20.00 

0.00 

lit 

3.00 

6.32 

50.00 

0.00 

15 

2.85 

11.80 

100.00 

0.00 

16 

2.78 

5.26 

25.00 

0.00 

Overall  Average  =  2.5$%  of  users 
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TABLE  I(j) 

CFWT:  THE  PERCENTAGE  OF  USERS  IN  CONSOLE  FUNCTION  WAIT 


Sample  Number 

Mean 

Standard  Deviation 

Maximum 

Minimum 

1 

12.51 

11.45 

42.86 

0.00 

2 

5-17 

7.29 

36.36 

0.00 

3 

2.88 

5.35 

22.22 

0.00 

1+ 

18.60 

16.97 

71.43 

0.00 

5 

1.37 

4.16 

22.22 

0.00 

6 

3.80 

6.02 

22.22 

0.00 

7 

3.82 

9.39 

40.00 

0.00 

8 

2.40 

6.02 

28.57 

0.00 

9 

9.06 

13.79 

66 . 67 

0.00 

10 

8.00 

8.90 

28.57 

0.00 

11 

4.04 

5.43 

25.00 

0.00 

12 

5-92 

7.68 

37.50 

0.00 

13 

2.3b 

6.21 

33.33 

0.00 

l4 

8.61 

8.53 

40.00 

0.00 

15 

0.26 

2.97 

33.33 

0.00 

l6 

2.81 

5.02 

16.67 

0.00 

Overall  Average  =  5-7$  of  users 
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TABLE  l(k) 

EQ1:  THE  PERCENTAGE  OF  USERS  ELIGIBLE  TO  ENTER  Q 


Sample  Number 

Mean 

Standard  Deviation 

Maximum 

Minimum 

1 

6b.  69 

14.12 

100.00 

12.50 

2 

91.1b 

9.90 

100.00 

57.14 

3 

6b.  80 

11.70 

90.00 

33.33 

4 

81.20 

15.86 

100.00 

40.00 

5 

76.57 

11.62 

100.00 

33.33 

6 

89.26 

9.19 

100.00 

50.00 

7 

71.85 

16.69 

100.00 

0.00 

8 

62.91 

12.86 

100.00 

28.57 

9 

90.39 

10.57 

100.00 

50.00 

10 

76.02 

10.83 

100.00 

16.67 

11 

86.62 

11.08 

100.00 

25.00 

12 

75.18 

12.78 

100.00 

33.33 

13 

86.47 

11.01 

100.00 

40.00 

lb 

73.87 

l4. 91 

100.00 

10.00 

15 

50.96 

20.13 

100.00 

0.00 

16 

76.54 

8.33 

100.00 

41.67 

Overall  Average  =  76.32%  of  users 
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TABLE  1(1) 

Ql:  THE  PERCENTAGE  OF  USERS  IN  Q 


Sample  Number 

Mean 

Standard  Deviation 

Maximum 

Minimum 

1 

3.61 

7.70 

55.56 

0.00 

2 

k.6l 

7.59 

42.86 

0.00 

3 

2. 77 

5.51 

33.33 

0.00 

4 

6.77 

9.56 

50.00 

0.00 

5 

3.34 

6.78 

37.50 

0.00 

6 

5.38 

6.91 

37.50 

0.00 

7 

1.49 

7.09 

100.00 

0.00 

8 

4.63 

7.75 

42.86 

0.00 

9 

7.72 

8.79 

37.50 

0.00 

10 

1.59 

4.82 

50.00 

0.00 

11 

5.39 

6.69 

41.67 

0.00 

12 

3.86 

7.61 

57-14 

0.00 

13 

6.29 

9.51 

40.00 

0.00 

lk 

4.31 

6.97 

40.00 

0.00 

15 

2.76 

11.43 

50.00 

0.00 

16 

6.18 

7.05 

41.67 

0.00 

v'  . 

Overall  Average  =  4.42 l  of  users 
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TABLE  I(m) 

EQ2:  THE  PERCENTAGE  OF  USERS  ELIGIBLE  TO  ENTER  Q2 


Sample  Number  Mean 

Standard  Deviation 

Maximum 

Minimum 

1 

2.1+1+ 

5. 31+ 

28.57 

0.00 

2 

0.50 

2.60 

Ik.  29 

0.00 

3 

1.79 

1+.13 

22.22 

0.00 

U 

0.99 

3.99 

20.00 

0.00 

5 

2.21+ 

5.71 

33.33 

0.00 

6 

0.71 

2.72 

12.50 

0.00 

7 

5.59 

12.89 

50.00 

0.00 

8 

2.50 

5.53 

28.57 

0.00 

9 

0.12 

1.33 

16.67 

0.00 

10 

2.13 

5.31+ 

33.33 

0.00 

11 

0.75 

2.1+8 

16.67 

0.00 

12 

2.1+7 

5.68 

28.57 

0.00 

13 

0.76 

3.83 

20.00 

0.00 

lU 

1.1+5 

3.67 

20.00 

0.00 

15 

'7.51 

18.22 

100.00 

0.00 

l6 

1.18 

3.00 

18.18 

0.00 

Qyerall  Average  =  2.08%  of  users 
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TABLE  l(n) 


Q2 :  THE 

PERCENTAGE  OF  USERS 

IN  Q2 

Sample  Number 

Mean 

Standard  Deviation 

Maximum 

Minimum 

1 

29.2k 

14.17 

71.43 

0.00 

2 

3.13 

6.33 

28.57 

0.00 

3 

30.62 

12.01 

55.56 

0.00 

1+ 

11.01 

11.82 

57.14 

0.00 

5 

17.83 

11.79 

50.00 

0.00 

6 

4.62 

6.21+ 

33.33 

0.00 

7 

21.05 

18.39 

66. 67 

0.00 

8 

29-93 

12.23 

57.14 

0.00 

9 

1.75 

1+.74 

25.00 

0.00 

10 

20.2k 

10.93 

50.00 

0.00 

11 

7.21 

8.32 

41.67 

0.00 

12 

18.1+6 

11.85 

50.00 

0.00 

13 

6.1+6 

9.61+ 

40.00 

0.00 

l4 

20.35 

13.93 

60.00 

0.00 

15 

38.75 

24.83 

100.00 

0.00 

16 

l6.08 

6.95 

36.36 

0.00 

Overall  Average  =  17.3%  of  users 
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TABLE  111(a) 

NUMDIAL:  THE  NUMBER  OF  USERS  DIALLED 
ON  TO  THE  APL  VIRTUAL  MACHINE 


Sample  Number 

Mean 

Standard  Deviation 

Maximum 

Minimum 

1 

16.00 

0.00 

16.00 

16.00 

2 

16.56 

0.58 

17.00 

15.00 

3 

1I+.9I+ 

0.76 

16.00 

lit.  00 

1+ 

7.50 

0.86 

10.00 

6.00 

5 

10.98 

0.90 

12.00 

9.00 

6 

11.26 

0.1+1+ 

12.00 

11.00 

Overall  Average  = 

12.87  users 

TABLE  111(b) 

TIMEUSED 

MACHINE 

:  THE  PERCENTAGE  OF  TIME 
WAS  IN  PROBLEM  MODE  OR  CP 

THE  APL 

MODE 

Sample  Number 

Mean 

Standard  Deviation 

Maximum 

Minimum 

1 

37. 27 

1+3.37 

100.00 

0.20 

2 

53.37 

31.78 

100.00 

0.98 

3 

18.1+5 

21.35 

99.1+0 

0.70 

1+ 

19.93 

31.73 

100.00 

0.30 

5 

23.12 

32.31+ 

100.00 

0.29 

6 

56.70 

1+3.36 

100.00 

0.29 

Overall 

Average  = 

3l+ . 8^  of  total  time 

\ 
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TABLE  IIl(c) 

VTOTTIME:  THE  PERCENTAGE  OF  TIME  THE 
APL  MACHINE  WAS  IN  PROBLEM  MODE 


Sample  Number 

Mean 

Standard  Deviation 

Maximum 

Minimum 

1 

33.94 

43.01 

100.00 

0.00 

2 

45.96 

32.22 

100.00 

0.00 

3 

10.51 

20.19 

98.40 

0.00 

4 

16.95 

31.23 

100.00 

0.00 

5 

19.65 

31.68 

100.00 

0.00 

6 

53.58 

43.18 

100.00 

0.00 

Overall  Average  = 

30.1%  of  total  time 

TABLE  Ill(d) 

NUMPAGES:  THE  NUMBER  OF 
THE  APL  MACHINE  HAD  IN 

PAGES 

CORE 

Sample  Number 

Mean 

Standard  Deviation 

Maximum 

Minimum 

1 

48.97 

2.81 

52.00 

39-00 

2 

.  50.82 

2.13 

52.00 

32.00 

3 

61.75 

10.19 

91.00 

48.00 

4 

58.74 

13.35 

90.00 

39.00 

5 

50.25 

2.81 

52.00 

29.00 

6 

52.00 

0.00 

52.00 

52.00 

Overall  Average  =  53. T  pages 
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APPENDIX  D 


Histograms  in  Figure  HI  to  H13  represent  the  frequency  plots  of 
the  variables  CPTIME,  WAITTIME  (low  value),  WAITTIME  (high  value), 
OVERHEAD,  PGREAD,  PGSWAP,  PGWT,  IOWT ,  CFWT,  EQ1,  Ql,  Q2,  EQ2,  and 
PROBTIME,  respectively.  Every  tenth  value  of  the  variable  was  used. 

The  sample  output  given  here  is  from  sample  number  3.  The  high  values 
of  WAITTIME  are  plotted  from  sample  number  9*  Histograms  in  Figure  HI 4 
to  HIT  are  the  frequency  plots  of  the  "APL  virtual  machine"  variables 
(TIMEUSED  -  VTOTTIME),  TIMEUSED,  VTOTTIME,  and  NUMPAGES.  In  this  case, 
every  fifth  value  of  the  variables  is  plotted.  The  output  in  this 
appendix  is  from  sample  number  h. 
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Fig.  H2(a).  Histogram  of  Low  Values  of  WAITTIME 


Fig.  H3.  Histogram  of  0VF1RHEAD 
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Fig.  H';>.  Histogram  of  PGSWAP 


IOWT  Percentages 


Fig.  HT*  Histogram  of  IOWT 


Fig.  H9-  Histogram  of  EQ1 
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Fig-  H10.  Histogram  of  Ql 


Q2  Percentages 


Fig-  Hll.  Histogram  of  Q2 


Fig.  H13.  Histogram  of  PROBTIME 


Frequency 


Fig.  HlU.  Histogram  of  (TIMEUSED-VTOTTIME) 


Frequency 


TIMEUSED  Percentage 
Fig.  H15.  Histogram  of  TIMEUSED 
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Fig.  H17-  Histogram  of  NUMPAGFS 


APPENDIX  E 


A  sample  output  of  the  four  regression  results  discussed  in 
Section  5-5  is  given  here.  The  following  32  variables  were  used. 


1 

PGRD 

2 

(CPTIME- OVERHEAD) 

3 

WAITTIME 

1+ 

OVERHEAD 

5 

(PGRD  +  PGSWP ) 

6 

PGSWP 

T 

PGWT 

8 

IOWT 

9 

CFWT 

10 

EXCFN 

11 

Q2 

12 

Q1 

13 

EQ2 

Ik 

EQ1 

15 

PROBTIME 

16 

( CPTIME-OVERHEAD ) 

IT 

(EQ1-CFWT) 

18 

(EQ1-CFWT)2 

19 

(CPTIME)2 

20 

(WAITTIME)2 

21 

(OVERHEAD)2 

22 

(PGRD)2 

13b 
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23 

(PGSWP)2 

2k 

( PGWT ) 2 

25 

( I OWT ) 2 

26 

( CFWT ) 2 

27 

( EXCFN ) 2 

28 

(Q2)2 

29 

(Ql)2 

30 

(EQ2)2 

31 

( EQ1 ) 2 

32 

( PROBTIME ) 

The  four  dependent  variables  chosen  were  PROBTIME,  (CPTIME  - 
OVERHEAD),  PGRD ,  and  (PGRD  +  PGSWP).  The  first  result  is  from  Sample 
7,  with  1108  observations,  and  the  remaining  three  results  are  from 
Sample  13  with  1117  observations.  In  all  cases  one  variable  was  chosen 
as  the  dependent  variable  and  the  rest  were  considered  as  the  indepen¬ 
dent  variables.  In  some  obvious  cases,  some  variables  were  deleted 
before  the  regression  analysis  was  carried  out.  For  example,  when 
(PGRD  +  PGSWP)  was  the  dependent  variable,  PGRD  and  PGSWP  were  both 
deleted.  An  independent  variable  was  allowed  to  enter  the  regression 
only  if  the  reduction  in  the  error  sum  of  squares  attributed  to  that 
variable  was  greater  than  one  per  cent  of  the  total  sum  of  squares. 
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